^^^^^^-'•420Reo'dPcfTO 29 NOV 1999 

,n J of ih- -i"""'"«'me ends 

■ »,.nalofeuka™o,icc*mi»dismbuK<lonUnearchromoson^.Theen<bof 
^ gen«,c of euka^ ^^^^^ ^^^^ 

.^ese hereditary — are termed «lomeres. ^^^^^ ^ 

(pan or Mos. — J-^^ J^, . .lere sequences of 

„,a,edorg.n,.ms are of» s^Ur ^d ^ ^ 

which are more phyloienelically remole. 

TTAnnn in all the vertebrates which nave so im 
constructed from the sequence TTAGGG m all 

examined (Meyne et al, 1989). 

« a variety of important functions. They ptevent the fusion of 
— ; 7^,, ^ the fonnation of dicentnc her^tary 

Chromosomes ^^^ 'Zr^ .^n .ead to the development 

units. Chromosomes of this nature, t. 

„, eancer due to loss of hetem^ygosity or the duplication or loss of genes. 

»rve the purpose of disdnguishing intact hereditary units from 
In addition, telomeres serve the purpos haibouted a 

Tt^K veast cells ceased dividing when they hatoour 
damaged hereditary units. Thus, yeast 

chromosome which lacked a telomere (Sandell and Zakian, 1993). 

. „.ni lask in association with DN A replication in eukaryotic 
Telome,escarryouta„otherimpo,.ant,askin»o ^ 

cells. In contrast to the circular genomes of prokaryot^, 

cukary^es cannot'l. completely replicated hy the DNA P"'^'— '^^^^^ ^ 
^ required for iniUating DNA replicati^. After the RNA pnmers have l«u hmi 
re^a^a. fragments haveheen extended and then ligated. the --^2. 
,^d lacks the . end hecause the RKA primer at that P°™' ^^^^^ 

♦ho.,, .necial protective mechanisms, the chromosomes would snn 
For this reason, without special prote 199O). The non-coding telomere 

every cell division ("end-replicauon problem , Barley al.. 



,„ .reventtag the loss of genes (San-le" 

^..nces p^baW, ^P^"' ^ 



Zakian, 1993). 



,» *y an important n,te in regulating cell ageing 
Ov. »d above this, — ^'^mlibi. a lin^ted capacity » .plicate in cnltute; 
(Olovnikov. .973). Hutnan somanc eel ^^^^^^ ^ ^ 

a certain time they become senescent. 

heing s— ated with ^wth factors; J ,« a cell 

determines from the lengm oi 

„ thA aopine of cells and in 
^ the telomeres consequently possess central functions m the agemg 
In summary, the telome .^^ ^^^^^ 

the stabihzation of the genetic material an p 



.^tViPdyes thetelgmgsi 
Theenzymetel^ 



15 



120 



• . Linear chromosomes ai^ only able to replicate 
AS described above, organisms posses^ ^ ^^^^ ^^^^^^^ M,,, 

,Heir genomes incompletely in the a ^^^^^^^^^ ^^^^^^^ 

eukaryotes use a special enzyme, i.e. te ^^.^^ ^^^^ 

„se IS expressed constitu.vely in .^^^^^^^^^ J ^^^^^ ... 

cammed. By contrast, in humans, ^^^^J^ ^^^^^ „e (Kim . 
tumour cells whereas neighbouring somatic tissued 



1994). 



25 



30 



TelomerasejtLcili^tes 

.ae„,if,ed for .he firs, rime in .he ciliate r«r*-" 
Uy. .he *n«res, .elomerase was ^^^^^ ^ .„g,e-stra^ 

.Hemopm. Telomerase activty was (Oreider and Blackburn. 

„„g„„„cleo.ide dCrrOGOG). in dre presence ^ 

. .is „achon, 0. ..r^y^ 7.g.ar .elorr^ sequence of 
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10 



15 



■,H .he « sequence of r.,™).^- (O-i-^' 
e^enae. pHn,er w* .e «lo^ 

1985). From these results, it was cone 
the sequence of the telomeres. 

. of an RNA componen. in *e ■e.ome.se ini«a.,y been den»ns«ea 
once me e..enc of an R«A c , ^ ^^^^ „^ , 

(aeide, and Blackburn, 1987), *e gen ^ ^^^^ ,^ 

!..„ irmidet and B ackbum, 198V). inis lu 

complementary to the Tetrakymm component, as »as 

* °' t r ri^rr™ : s:L,uent .o. . act... . 

— " H^A » mutated, the cot^spondln, mutatlo. 

complementary region ot the leio fVu a/ 1990). Telomerase 

. • • i«m the Tetrahymena telomeres (Yu ei ai-, 
were incorporated in vivo into tne letru y 

were mcorp rn A-dependent DN A polymerases, 

consequently belongs to the class of RNA dep 

t.lnmerase i e. p80 and p95, were identified in 

.cf.tp..eln.— o.- 

«s the RNA component led to he foU w g ^^^^^ 

v.r. tn the sinde-stranded 3 overndiig. 
complementary tegton to the sing ^^^^ „, 

„tlng the corresponding ^ _,^^,„„ o„ce a telomere 

„ P^bably iuvoives^n ~ J ^^^^.^ _ PNA nnUl it is 

— - been ^V"-'^;^ * ^ _ sequence. Uis mode, does not 
„«e again in a posttton to be able « P ^ ^^^^^ ^ 

- .0 be gener^ly ..id -^- -^ add. .fore ,t dissociates 
species with regard to the number of nucleoua 

25 from the telomere (Prowse et al, 1993). 

...... xnr:-:— - 

u •» ^i'?'^ fxhibits a basic domain at us i^i 
The telomerase subunit pl23 exhibits 

■ t n?n at the C terminus, suggesting this protem nas 
reverse transcriptase (RT) at me ^ 
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nl 23 has been reported to share significant homology 
O^ngner . ... 19., — '^^^^^^^^^ ^ ...lad (Un.er . 

with the Saccharomyces cerevmae protein list 



a/., 1997). 



c . t hitherto been demonstrated to possess any function v,hich is 
Whereas p80 ^d p95 have not httherto be^^ ^^^^^^^^^ ^^^^ 

,een unambiguously shown to ha e l.y^^^^^ 
RT led to significant truncation of the telomeres y 



JO Tplnmerase c 



25 



omEonsitdEOIIUIl^^ 



, of .he — es of vartous organise, Wer alia of 
cer^isiae. m,ce a»i huma,^s (S g ^^^^ ^ ^ 

primary sequence of the human ,,,^i,iae On the other hand, regions 

components of .he chafes or of — — 
e.,s, which a. oons^ea he^een human and ™™e .e>o„«ase RNA (F 

• . A nrntein (hTPl) has recentiy been described 
^ i.«o„ of a human 1 „ pSO 

a^„g,on « al.. .997). On «-e has,s of - — ^ „„, ,„ ,he 

.„hunu, *e co„es^n*g gene was found ,n an EST aa.a b« 

. ^1 IQQ?^ hTPl is composed of 2bli anaw 
genera, pubhc (Han,ng.on « 1997> ^.^ ^ 

N-feminus, exhWts .h.ee 6oma,ns wh,ch possess ^ 

,1 *^;r.tnnhan and asparagine, whicn prci>uuia j 
„pca,s of *e amino ac,ds '^^ ' J . . addi^ona, sm,c- efemen.. in 
prolein/ptotein infraction, were shown to be present. 



the C-terminal region. 
ActiyationoOhe^^ 



30 



Le A 32 486-Foreign countries 




In humans, it was originally only possible to demonstrate telomerase activity in germ line 
cells and not in normal somatic cells (Hastie et al, 1990; Kim et al, 1994). After a more 
sensitive detection method had been developed (Kim et a/., 1994) a low level of telomerase 
activity was also detected in hematopoietic cells (Broccoli et aL, 1995; Counter et ai, 1995; 
Hiyama et aL, 1995). However, these cells nevertheless exhibited a reduction in the telomeres 
(Vaziri et ai, 1994; Counter et ah, 1995). It has still not been clarified whether the quantity of 
enzyme in these cells is insufficient to compensate for the telomere loss or whether the 
measured telomerase activity stems from a subpopulation, e.g. of incompletely differentiated 
CD34"^38'*" precursor cells (Hiyama et aL, 1995). In order to clarify this point, it would be 
necessary to detect the telomerase activity which was present in a single cell. 

Interestingly enough, however, significant telomerase activity has been detected in a large 
number of the tumour tissues which have been tested to date (1734/2031, 85%; Shay, 1997), 
whereas no activity has been found in normal somatic tissue (1/196, <1%, Shay, 1997). In 
addition, a variety of investigations demonstrated that the telomeres continued to shrink in 
senescent cells which were transformed with viral oncoproteins and that it was only possible 
to find telomerase in the subpopulation which survived the growth crisis (Counter et aL, 
1992). The telomeres were also stable in these inmiortalized cells (Counter et aL, 1992). 
Similar findings derived from investigations in mice (Blasco et aL, 1996) support the 
assumption that reactivation of the telomerase is a late event in tumorigenesis. 

Based on these results, a "telomerase hypothesis" was developed which links the loss of 
telomere sequences and cell ageing to telomerase activity and the genesis of cancer. In long- 
lived species such as humans, the shrinking of the telomeres can be regarded as a tumour 
suppression mechanism. Differentiated cells, which do not contain any telomerase, cease 
dividing when the telomeres have reached a particular length. If such a cell mutates, a tumour 
can only develop from it if the cell is able to extend its telomeres. Otherwise, the cell would 
continue to lose telomere sequences until its chromosomes became unstable and it finally 
died. Reactivation of the telomerase is presumably the main mechanism which tumour cells 
deploy in order to stabilize their telomeres. 
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„ follows from ftesc otaervaUons and , to « it should be possible .0 develop a topy for 
,„ based on inhibiting telon^rase activity. Conventional cancer thetapies us,ng 
cytostatic agents or short-wave irradiation dantage all the dividing cells in dte body ,n 
addition to dan^sng the tnmour cells. However, since it is only ge,™ Hne cells whtch 
contain significant telomerase activity, apart from tumour cells, telomerase inhibitors would 
attack the tumour cells more specifcally and consequently evoke fewer undesirable s,de 
effects Since telomerase activity has been detected in all Ute tumour tissues tested to date. ,t 
would be possible to employ these therapeutic agents against all types of cancer. The effect of 
telomerase inhibitor, would then set in when the telomers of the cells had shortened to such 
an extent that the genome had become unstable Since tmnour cells usually exhibit shorter 
telomeres than do nonnal somatic cells, it would be cancer cells which would first of all be 
eliminated by telomerase inhibitors. By contrast, cells possessing long telomeres, such as the 
gem cells, would not be damaged unttl a much later stage. Telomemse inhrbttors 
consequently,epresentanappn>achwh,ch points the wayforward for cancertherapy. 

However, ,t will only be possible to ptovide unambiguous answer, to questions regarding me 
nature and the points of attack of physiological telomerase inhibitors when the protem 
snncmres of the enzyme, together with flteir functions, have also been identified and a deeper 
understanding of the vafious telomere-binding proteins has been obtained. 

ne invention relates to the catalytically acUve human telomerase subunit (phTC). where 
appropHate ,n purified form, to active moieties of the protem. to modulator., in pamcular 
agonists of the protein, to substances which imrtate the function of the protein and to 

combinations of these components. 

The invention furthermore relates to: 



The nucleic acid sequence 



which encodes the human protein phTC, specifically: 




30 ( - the genomic sequence o fthe hTC gene^ 

the cDNA sequence of the hTC gene, 



*"'^^""rir;i-— ----- 

^ , . N f WTC which is shown in Fig. 1 • 

(SEQ ID No. I) of hTC wnicn 

Hi.h encode hTC-homologous proteins in other 
The nucleic acid sequences wh.ch encode 

manmals, specificaUy. 

.hegenomicsequencesofhTC-homologous genes, 

. the sequences of the mKiNA!> 
genes, 

. parts of the abovementioned sequences. 

.hich in hutnans and other — , encode proteins which 
Nucleic acid sequences which, in 

are related to the phTC protein, specifically: 



.f hTC-related genes in humans and other mammals, 
the genomic sequences of hTC re ^ ™s and other mammals, 

e u-rr related genes in humans ana oui<^ 
thecDNAsequencesofhlC-re ^hed from hTC-related genes in 

the mRNAs which are transcribed trom 
the sequences of the nuuNr. 

humans and other mammals, 

parts of the abovementioned sequences. 

. HTC protein which is isolated from mammalian cells (cf. 
The above-described phTC protem. 

Fig. 2 and SEQ ID No. 2). 

. . • labelled with a detection reagent, with the detection 
The PhTC p«>tein which is labelled w ^^^^^^^ ^ 

.agent preferably being an enzyme, a radioactive y 

chemical. 

u- K ic directed against the phTC protein. 
An antibody which is directea ag 



.cco*g .0 a p^fened e^bodi^en. .Ms anUbCy ,s a pCycl- anybody. 
^.„*,.oa„o,.=rp.rer.ae™..™"...Msa„,.bc.y,sa— a„.^,. 

, can be pmduced, for example, by mjecti^g a host, which is 

^„,,^es of *,s na»« ^ ^ P ^ „^ ^ ,„^,„, 

:r r::z:r ----- '-'"^ 

this antibody. 

■ nnalized cell line which produces monoclonal antibodies can be 
In addition, an immortalized ceu hik^ 

obtained in a manner known per se. 

Where appropriate, the antibodies can be labelled with a detection reagent. 

employed instead of the complete antibody. 

, f c.,.h a detection reagent are enzymes, radioactively labelled 
Prefeited examples of such a detection g 

elements, fluorescent chemicals or biotin. 

■ form which have a sequence which is identical or exactly 

Oligonucleotides m purified form J ^^^^^^ 

complementary to a contiguous sequence, of from 10 
the above-described genomic DNA, cDNA or mRNA. 

^„ Oligonucleotide of this nature can. in par^cular, be . oligodeoxy-.bonucleotide 
or an oligoribonucleotide or a peptide nucleic acid (PNA). 

„ is ,ven to oligonucleotides which inhibit, repress or bloc, the activity of 
the telomerase when they bind to the hTC mRNA. 
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A DNA sequ^A a degenemte variadon of «s sequence, »hich encodes .he 
phTC ■m.tein, or a trLen. of .his p,o.ein, where appropriau con^pnsing *e DNA 
sequence in Figure la,\a DNA sequence which hybriCzes wi.h .he previously ci.ed 
DNA sequence under s.an\ard hybtidizadon conditions. 

A recombinanKDNA molecule which comprises a DNA sequence, or a degenen,.e 
variarion of .his Vence, which cnc^es phTC or a fragn^n. of phTC, wi* .he la„er 

sequence preferablVp** " """"r 

a DNA sequence whSvhybridizes wid, .he previously ci.ed DNA sequence under 

Standard hybridization coi^tions. 

in .he abovemeMioned recombinant DNA molecule, .he described DNA is preferably 
linked to an expression connol sequence. 

Examples of expression control sequences which are pariicularly preferred are the 
early or promorer of the SV40 virus or adenovirus, the lac system, *e np sys.em, 
.he TAC sys.em, the TOC system, .he main operaror and promoKr regions of phage 
.he contrel regions of the fd coat protein, the 3-phosphoglyce,a,e kinase promoter, 
U,e acid phospha.ase promoter and .he yeas, a-mating factor promoBr. 

A single-cell host which has been «ansfonned with the above-described recombinant 
DNA molecule which comprises .he DNA sequence, or a degenerate variation of th,s 
sequence, which encodes U,e phTC protein or a part of this protein. In d„s 
recombinan. DNA molecule, *e said DNA sequence is linked to an express,on 
control sequence. 

n • i« Vinct are- E coU Pseudomonos, Bacillus, 
Preferred examples of the single-cell host are. t. con, r 

tc rwo Rl 1 B-W L-M, COS 1, COS 7, BSCl, BSC40 and 
Streptomyces, yeasts, CHO, Kl.i, d w, 1 1, 

BNTTIO cells, plant cells, insect cells and mammalian cells in cell culture. 
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molecule, or a derivative or fagment of .his molecule. 

^ method for inhi«„. ~ acdvi., in hu™» cells, p«fe«bly .eoplasuc 
l: i„ Which an exogenous P«,ynuc,eo.ide *h co„sis.s of a uanscnphon un. 
*,e^ in.o .he cells. Ws ™scrip.ic. uni. comprises a po,yn„c,eo.,de se^ue ce 
7T1 ^ consecu.ive nucleohdes, which sequence is suhs,a„ria„ i«.c. or 

to the hTC RNA sequence and is linked to a 
substantially complementary to the hTC kina q - ..nnfthe 

iot;r,o ^Pnuence which controls the transcnption ot the 
heterologous transcription-regulaUng sequence wnicn 

linked polynucleotide in the said cells. 

P^ferably. *e ahovem..ioned he.ro,o,ous .r.scrip.io„-.gu,a.ing sequence 
comprises a promoier which is consri.u.ively aciive in human cells. 

AUemanvel, .he he.ero,ogous .ranscrip.ion-.8Ula.ing sequence can comprise a 
r: Which can be induced or reposed in human cells h, adding a 
Lhsrance. Examples of such promote, a. inducible and 
aependen. p.o™,.ers. hea. shoe, protncers and meta, ion-dependen. promorets. 

^ abovemenrioned exogenous Hynucieo.ide c., for example, he a vir. genome 
con.a,„ing a „anscnp.io„ un,t f«>m the human hTC DNA compK-nent. 

Panicularly preferabiy, .he sa.d ascription uni. produces anrisense RNA which is 
subs-aliy completneniary .o *e human hTC RNA componen.. 

Par^cular p« is also given .o .he exogenous poiyaucleotide being able .o 
comprise the sequence in Fig. 1 a. 

A pol^ucle^ide for .he generic *erapy of a human disease, litis polynucie.-* 

eonsis^of a »anserip.ion uni. which comprises a P.>^...eoride 

, consecuitve nucleorides. which se,ue„ce is subsianriaUy idenltcal or subs^nually 
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^Hp«„„..gu.a*g sequence which ccTois .he of .he hnRed 

polynucleotide in said cells. 

A «U,od fo, de.ec.ng ,elome^-assoeia.ed condicion. in a pa.ien., which meftod 

comprises the following steps: 

A. Dceciing .he phTC pro.ein i. fluids o, ceh samples in orfer .o ob.ain a 

diagnostic value; . . 

B CompaHng .he diagnosUc vaine wid. s.andard value, for ,he phTC p™«.n ,n 
standardized normal cells or body fluids of .he same .ype as *e .est sample; 

C. Detecting dia^ostic values which ^ higher or lower ,ha„ the stan*rd 
comparative values and which indtcate a telomerase-associated cond,„on, whtch 
condition in turn indicates a pathogenic condition. 

Th,s method is p«ferably employed for detecting a neoplashc dtsease in a patient. 
The method then comprises die following steps; 

A Detecting the phTC protdn in cell samples in order to obtain a diapostic value; 
b: Companng the diagnostic v.ue with sundard values for the phTC protetn m 

non-neoplastic cells of the same type as the test sample; 
C. Diapostic values which a« clearly htgher than standard comparative values 

indicate a neoplastic condition. 

A method for detenmning the p^sence of the phTC protein in a cell or cell satnpU^ 
which method is bas«i on amphf^ng an hTC polynucleotide, or hybridizing an hTC 
polynucleotide, a primer or an hTC-complementar, sequence wrth », 

polynucleotide. 

A test Idt for detecting phTC in cell samples ^d body fluids, with it being possible. 
f„ example, for labelled, immonochemically-reactive components to be: polyclonal 
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an^^es a,.„s, phTC, —a. — ag- phTC, fra^n. of ^ 
antibodies or a mixture of these components. 

A method for pteventing and/or treating ceil disturbance or ae.™ct,on and/or 
IZction anL odter symptoms in hnmans. which method .based on 
:l:ste.ng a therapeuticah, e«ec.e of cata,.ica,„ acnvc um» 

teiomerase. its functional equivalents or its catalyticaliy active fragments ., ts 

to concve of using a su.ta„ce which promotes the pro^tc«o„ - 
Lvity Of phTC a substance which can imitate the ^ of ph^. — 
which can inhibit .he p,«iuction and/or actmty of phTC, or 
substances. A specific binding partner can also be employed. 

The method is preferably employed for p^venting or toting agetng or cancer 



diseases. 



substances whtch at. able to affect the activity of phTC, ..e. — 7 
be„ temted modulators. Such mc.*tors can be found, in a — ^ 
testing thcr effect on telomerase activUy in a telomerasc assay. Examples 
lelomerase assays are given in Example 15. 

Modulators of PhTC a. of .... fo. .ea.ng .seases w.c. a. .o^^ 

may, in particular, be mentioned in this context. 

■ . .K^ uTC TTiRNA which nucleic acid comprises a 
An antisense nucleic acid against the hTC mRNA, wnicn 
::i"Uuencewhicbhyb„..swiththesa.dmRNA.».h.heanUsensenucle,c 

acid being an RNA or a DNA. 

.i^ir acid binds to the start codon of the particular mRNA. 
Preferably, the antisense nucleic acid Dinas 
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A recombinant DNA molecule which conunn, a DNA sequence from wh,ch an 
antiscnse ribonucleic acid against the hTC mRNA is produced during transcnptron. 
TOs said antisense ribonucleic acid comprises a nucleic acid sequence wh,ch can 

hybridize to the said hTC mRNA. 

A DNA molecule of .his namre can be used to p«P- a eel, line having a reduced 
exp^sion of phTC by ,r»sfec.,ng a phTCproducing ceil line with thrs recombmant 
DNA molecule. 

A ribozyme which cleaves the hTC mRNA. 

Thrs ribozyme is preferably a Te,raKy.e.a.y^ nbozyme or a han^erhead-type 

ribozyme. 

A recombinant DNA molecule which contams a DNA sequence whose transcription 
leads to the production of a ribozyme of this nature. 



This recombinant DNA 



molecule can be used to transfect a phTC-producing cell line. 



A combination which consists of a pair of huma. hTC polynucleotrde PCR pnmer.. 
with the primers p»ferably consisting of sequences whrch cor^sfond to the s^uence 
of the human hTC mRNA or which are complementary to this sequence. 

A combination which comprises a polynucleotide hybridization probe for the human 
hTC gene, with the probe preferably comprising at leas, 29 consecutive nucieotrdes 
which correspond to the sequence of the human hT€ gene or whrch a., 
complementary to this sequence. 

Animal models which can be used to investigate telomerase/telomere regulation « 
Wvo. Thus, tumour development and age^g can, for example, be drrectly mvesugated 
using knockout animals or transgenic animals. 
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^ „e case of p™.in. or peptides, *.nc.o„a, ^.valen. a„ .hose compounds wMcH wM,e 



functions. 



30 



isoenzymes or so-called niicroheterogeneities in 
Known examples of these compounds are isoenzymes 



proteins. 



to ,he case of *e oHgonucleic or pol^ucleie acids, funcucna, e,uiva,e„,s are .o be 
lllas ... J co..u„ds *H .ffer . -eo.ide se,..ce 

t • The existence of such compounds may be attributed, for example, to the 
the same protem. The existence ui »u 

fact that the genetic code is degenerate. 



Explanation of the figures: 



. of the catalytic subunit of human telomerase (hTC) 
Fig. 1: cDNA sequence ot tne cawiyu^ 

(SEQ ID No. 1). 

K„. 2: An.no acid sequence -ch is deduced f,on> .he hTC DNA sequence depiced in 
Pi, 1 (SEQ ID NO. 2). ^^^^^^ ^ 

The DNA sequence depicted in rig. i ^ r 
Position 64 to Position 34^ into an amino acid sequence. The amino acid residues 
aie depicted in accordance with their single-letter code. 




25 Fig. 3: 



E*idiu™ bro™.de-s..ned agan,se ge, containing AA28.296 DNA which has been 

treated in different ways. ^;ff^,^nt 
^ f,g^ Shows an chidinn. bro^de-siained 0.8% agarose gei. Two d,«eren. 
ON A Size s-dards are loaded in ianes 1 and 8. wi* .he DNA fragmen. iengms 3^ 

i 0.5 and 0.4 Kb being poinied ou.. T.e AA28>296 DNA in pT^D was ^ges. 
a resuiCon en.^ Bco RI/No. ■ O^e 3). Ps, . flane 5> and Xho . aneT, 

Undiges«d AA281296 DNA in pT7T3D was loaded on.o ,ane 2. 1/10 of a PC 
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mixture (1 minute 94°C, 2 minutes at 60°C, 3 minutes at 72°C) with the hTC 
cDNA in pT7T3D and primers 1 (5' GAGTGTGTACGTC- 
GTCGAGCTGCTCAGGTC 3') and 4 

(5' CACCCTCGAGGTGAGACGCTCGGCC 3') [lane 4] and, especially, with 
primers 6 (5' GCTCGTAGTTGAGCACGCTGAACAGTG 3') and 7 
(5' GCCAAGTTCCTGCACTGGCTGATGAG 3') [lane 5] was applied to lanes 4 
and 5. 



Fig. 4: Detail from a comparison of the protein sequences of the Euplotes pl23 (pl23) 
and human (phTC) catalytic telomerase subunits. 

The conditions (ktuple, gap penalty and gap length penalty) are listed for the 
Lipman-Pearson protein comparison, using the Lasergene program software 
(Dnastar, Inc.), which is depicted in this figure. The amino acid residues are 
depicted in accordance with their single-letter code. The amino acids which are 
identical between Euplotes aediculatus pi 23 and the identified EST+i are also 
highlighted using the corresponding letter from the single-letter code. Amino acids 
which are not identical but whose function is similar or comparable are marked by 
a :. 



Fig. 5: Part of a comparison of the protein sequences of the catalytic telomerase subunits 
of Euplotes pl23 (pl23), and yeast (est2p). 

The condition (Ktuple, gap penalty and gap length penalty) are listed for the 
Lipman-Pearson protein comparison using Lasergene program software (Dnastar, 
Inc.) which is dipicted in this figure. The amino acid residues are shown in 
accordance with their single letter code. The amino acids which are identical 
between Euplotes aediculatus pl23 and yeast est2p are likewise given prominence 
by the corresponding letter from the single-letter code. Amino acids which are not 
identical, but which are similar or comparable in function, are marked with a :. 



Fig. 6: Detail from a comparison of the protein sequences of the yeast (est2p) and human 
(phTC) catalytic telomerase subunits. 
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The conditions (ktuple, gap penalty and gap length penalty) are listed for the 
Lipman-Pearson protein comparison, using the Lasergene program software 
(Dnastar, Inc.), which is depicted in this figure. The amino acid residues are 
depicted in accordance with their single-letter code. The amino acids which are 
identical between yeast est2p and the identified EST+i are also highlighted using 
the corresponding letter from the single-letter code. Amino acids which are not 
identical but whose function is similar or comparable are marked by a :. 

Fig. 7: Detail from a comparison of the protein sequences of the Euplotes pi 23 (pi 23), 
yeast (est2p) and human (phTC) catalytic telomerase subunits. The comparison, 
depicted in Fig. 5, between Euplotes pl23 (pl23), yeast (est2p) and humans 
(phTC) was carried out using the Clustal Method subprogram of the Lasergene 
program software (Dnastar, Inc.) under standard conditions. The amino acid 
residues are depicted in accordance with their single-letter code. The amino acids 
which are identical between yeast est2p, Euplotes aediculatus pi 23 and the 
identified EST+i are also highlighted using the corresponding letter from the 
single-letter code. In addition, the regions which are identical between all three 
proteins are marked by a light grey bar above the protein sequence. 

Fig. 8: Generated DNA sequence from Example 6 (RACE round 1) (SEQ ID No. 3). 

Fig. 9: Generated DNA sequence from Example 6 (RACE round 2) (SEQ ID No. 4). 

Fig. 10: Generated DNA sequence from Example 6 (RACE round 3) (SEQ ID No. 5). 

Fig. 11: Generated DNA sequence from Example 8 (RACE round 3) (SEQ ID No. 6). 

Fig. 12: Outline of the cloning of the complete hTC cDNA sequence. The positions of the 
start and stop codons are marked by arrows. The black regions of the rectangles 
symbolize protein-encoding sequence sections, whereas the pale grey regions 
symbolize 5 - and 3 - untranslated cDNA regions and/or denote intronsequences. 
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additional homologous amino acids. The telomerase-specific motif is described in 
Example 9. 

Fig. 14: Generated DNA sequence from Example 1 1 (3' version) (SEQ ID No. 7). The 
region which is not homologous with the DNA sequence depicted in Fig. 1 is made 
to stand out in bold. 

Fig. 15: hTC expression in cancer cell lines and normal human tissue. Fig. A: 
Approximately 2 |ig of poly-A"^ RNA from different human cell lines were 
immobilized on the Northern blot in accordance with the manufacturer's 
(Clontech) instruction. Specifically, the RNA originated from a melanoma (G361), 
a lung carcinoma (A549), an adenocarcinoma of the colon (SW480), from a Raji 
Burkitt's lymphoma, from a leukaemia cell line (MOLT-4), from a chronic 
leukaemia cell line (K-562), from a cervical tumour (HeLa) and from the 
leukaemia cell line HL60. The transcripts marked 4.4 kb, 6 kb and 9.5 kb . are 
specific for hTC (compare Example 10). Fig. B: About 2 )ig of poly-A*^ RNA from 
different human tissues were immobilized on the Northern blot in accordance with 
the manufacturer's (Clontech) instructions. Specifically, the RNA was isolated 
from heart, brain, placenta, lung, liver, skeletal muscle, kidney and pancreas. An 
RNA size standard is shown. 

Fig. 16: Western blot analysis of the rabbit sera against peptides from the human 
telomerase amino acid sequence (Example 12). In each case, 20 ^1 of the bacterial 
lysates from Example 13 were analysed in a western blot {Ausubcl et aL, 1987) 
using the antisera from Example 12. Lysates from bacteria which harbour the 
pMALEST construct were loaded in lanes 1, 2, 6 and 7. Lysates from bacteria 
which harbour the pMALAl construct were loaded in lanes 3, 4, 8 and 9. Lysates 
from bacteria which were not induced with IPTG (isopropyl-beta- 
thiogalactopyranoside) were loaded in lanes 1, 3, 6 and 8. Lysates from IPTG- 
induced bacteria were loaded in lanes 2, 4, 7 and 9. A standard size marker 
(10 kDa protein ladder from Life Technologies, Cat. No. 10064-012) was loaded 
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in lane 5. The 50kDa and 120 kDa bands are marked at the edges of the 
membranes. The PVDF membrane in Fig, A containing lanes 1 to 4 was incubated 
with preimmune sera against peptide B (compare Example 12). The PVDF 
membrane in Fig. B containing lanes 6 to 9 was incubated with preimmune sera 
against peptide C (compare Example 12). The PVDF membrane in Fig. B 
containing lanes 1 to 4 was incubated with immune sera against peptide B 
(compare Example 12). The PVDF membrane in Fig. B containing lanes 6 to 9 
was incubated with immune sera against peptide C (compare Example 12). 

Fig. 17: Autoradiogram of ^^S-labelled, in vjYrc»-translated protein. The complete in vitro- 
translated hTC (compare Example 15) was loaded in lane 1. A C-terminally 
truncated version of phTC was loaded in lane 2. Lane 3 shows a positive control 
for the in vitro translation which was supplied by the manufacturer (compare 
Example 15). A protein size standard for estimating protein sizes is marked on the 
right-hand side. 

Fig. 18: Autoradiogram of P-labelled products from the TRAP assay (compare 
Example 15). A TRAP assay mixture without any added enzyme or protein was 
loaded, as a negative control, in lanes 1 and 2. A TRAP assay mixture containing 
partially purified human telomerase from HeLa cells was loaded, as a positive 
control, in lanes 3 and 4. A TRAP assay mixture containing in vi7r(?-translated 
phTC was loaded, undiluted, in lanes 5 and 6. A TRAP assay mixture containing 
in vzYro-translated phTC, at a 1:4 dilution, was loaded in lanes 7 and 8. A TRAP 
assay mixture containing in v/rr<7-translated phTC, at a dilution of 1:16, was loaded 
in lanes 9 and 10. A TRAP assay mixture containing in v/rrc?-translated luciferase 
was loaded, as a negative control, in lanes 1 1 and 12. 

Fig. 19: Autoradiogram of P-labelled products from the direct telomerase assay (compare 
Example 15). A radioactively labelled 10 bp marker was loaded in lane 1. A 
telomer oligonucleotide ([TTAGGGJa) which was radioactively labelled 5' was 
loaded in lane 2. Lane 3 is an empty lane. Partially purified human telomerase 
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ft„„ HcU ceHs was used in a to, ass., and -he s,.*esis produc was loaded, 
as a pcmve control, in ,ane 4. V. /» v,-.»-™s,a». phTC ftom Example 15 was 
used in a direct assay and the syntltesis product was loaded ,n lane 5. 
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F.vamples 



Example 1 

. H that less than 5% of the human genome is in fact transcribed and 

It is nowadays accepted that less tnan , it k nossible 

■ p„.n before the genome has been completely sequenced, it is possible 
translated into protem. Even before the geno 

• . t information about the 60,000-70,000 genes m a human cell oy 
to obtain important information .^^ 

^^ina moieties of the genome in a specific manner, n'c 
investigating these coding moieties of g ^^^^ .^^^ 

high-throughput DNA sequencing technology m the last 10 to O y 

hign inroug H ^.^^^^^ ^^^^ sequence 

to collect many cDNAs from plasmidcDN A libranesot y ..,^30010 

. These short DNA sequences, which are typically of from 3UU 

the S' or 3' end in each case. 1 nese snou 1^1 m 

the 5 or J ciiu . compiled in 

400 bp in leng*. « .e^d ,.p.ssed ^uence ,ags EST. or an J 

. . u ... Th« EST approach was initially descnlied by Okubo 
.anous .peciali^ da.a ^^^^J^^ ^, „ ^, appro.n,a«,y 

"*^> ^ "-^'"^ '° " Ta tiled and doc— ed as EST entries. 
50,000 liuman cell genes are partially sequencea an 

. .he DNA and anuno acid sequences ot known genes, it .s possible ,0 

By companng w,tb the DNA ^^^^^^ 

,ae„,.y Elated, bnt hithe^o *wa^. e ,n * s^^ ^^^^^ 
1996). tBLASTn (Altschul er a/., 1990) is a search aig 

„..,.r.hispn^.™salgo*.r.slatese,e.DNAclone,n^^^^^^^^^^ 
s,x possible reading francs »d con,pa„» these amino aetd seqt^nces wtth the tao 



sequence. 



T. P^T data base at the National Center for Btotechnology Informatics, (NCBI) was 

Pobltshed P..n sequence for the .... — -.c 

1 Thi<; resulted in a human coi vvit* 
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The homology between pi 23 and the Est+i is most conspicuous in two sequence regions 
which are separated by 30 amino acids. The longer sequence region, which in pl23 extends 
from amino acid 438 to amino acid 484, is 38% identical to the corresponding region Est+i- If 
similar amino acids are also taken into consideration, the congruence is even 59%. The 
second block of homology extends, in the pl23 protein, from amino acid 513 to amino acid 
530 and exhibits 44% identity with the corresponding sequence segment in the identified 
Est+i. A congruence of 61% is obtained when amino acid residues having similar properties 
are taken into account. 

The P (probability) value is an important parameter for assessing a BLAST search. P 
indicates the probability of also finding a specific segment pair in a BLAST search using a 
random sequence and varies numerically between 0 (highly significant result) and 1 
(insignificant result). Thus, comparison of the pi 23 equivalent from yeast (est2p) with the 
NCBI EST data base, for example, gave a negative result: The EST which was found had a 
probability of P=l (Tab. 1). On the other hand, human telomerase-associated protein 1 
(hTPl), which was found in an EST data base which is not available to the general public 
(Harrington et ai, 1997), gives a probability of P=0.004. 



known gene (species) 


P 


identified gene 


origin of the cDNA 
library 


est2p (Saccharomyces 
cerevisiae 


0.999 


Rat EST 


Kidney 


p80 {Tetrahymena 
termophilia) 


0.004 


hTPl (Harrington et al, 
1997) 


Crypts of the intestinal 
epithelium 


pi 23 {Euplotes 
aediculatus) 




AA281296 


Germinal centres of the 
tonsils 



Tab. 1 : Comparison of three tBlastn search runs using different known genes. 
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hu^an EST AA2S.2,6 wWch was «ir,ed .he c„™p^» - P'^ ^ 



25 



probability of P=3.5x 10"*. 



. . A v<iT in all orobability encodes a fragment of the 

suh- of hu.an ~7^„^^ ^^^^ , „,,.„3.e<, .o phT.. 

below to hTC (human Telomerase, catalytic) and V 



Example 2 
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vector pT7T3D-Pac. 

3S9 bp Which had been fed into *e EST da.ba. were 
pn„er supplied by Aurer^ham (DNA sequence, see 1 Posttion 16S5 to 2073). 

„ pro^ software (Dnastar Inc., was used to translate the DNA sequence ^ 

Lasergenc piu^m rpsultins amino acid 

ESTAA281296 in accordance with the human genetic code. The resulting 

sequence (EsUO corresponds to Position 542 to 670 in Fig. 2. 

f F=t k composed of 129 amino acids, including 27 basic, 
The deduced protein sequence of EsU, is composea 

1 1 acidic, 5 1 hydrophobic and 28 polar amino acid residues. 
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The EST (AA281296) which was identified in Example 1 was obtained commercially from 
Research Genetics, Inc. (Huntsville) in the form of a plasmid transformed into E. coli and 
analyse experimentally: 

As shown in the ethidium bromide-stained agarose gel depicted in Fig. 3, a fragment from 
EST AA281296 of approximately 2.2 kb in size is liberated from the vector pT7T3D after 
subjecting the prepared plasmid DNA to restriction digestion. With the aid of a polymerase 
chain reaction (PGR), which was carried out in parallel and which made use of specific 
internal primers, EST AA28 1296 was inspected: the lengths of the expected PGR products 
are 325 and 380 bp and are in agreement with the lengths of the fragments which were found 
experimentally (cf tracks 4 and 5 in Fig. 3). This therefore demonstrated that the E.coli clone 
supplied by Research Genetics, Int. (Huntsville) therefore harbours the identified EST as a 
plasmid. 

After the DNA had been prepared, the 2176 bp of the insert in total were identified by means 
of double-strand sequencing. A comparison of the DNA sequences of clone AA281296 and 
of the C5F fragment (compare Example 7) showed that there was a 182 bp deletion (Positions 
2352 to 2533, Fig. 1) and that the open reading frame is consequently displaced in this region. 
In summary, the DNA sequence of clone AA281296 is composed of the sequence 
information shown in Fig, 1 (Positions 1685 to 2351 and Positions 2534 to 4042). 

Example 3 

The tBLASTn comparison only identifies the regions in which there is the greatest agreement 
between pl23 and Est+i (amino acids 438-530, in pl23), whereas the intervening amino acids 
are not taken into account. A Lipman-Pearson protein comparison was carried out in order to 
be able to draw conclusions about the relatedness of the protein sequences over a larger 
region (amino acids 437-554, in pl23) (see Fig. 4). When this was done, 34% of the amino 
acids were found to be identical while 59% of the amino acids were found to be either 
identical or biochemically similar. This results demonstrates that the relatedness of these 
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proteins also continues outside the regions of homology which were found using the 
tBLASTn program. 

As has recently been reported (Lingner et al., 1997), Euplotes aediculatus pi 23 and 
Saccharomyces cerevisiae est2p are homologous to each other. In order to relate the degree of 
affinity between pi 23 and est2p to the homology between pi 23 and EsL+i which is described 
here, the Lipman-Pearson protein comparison was employed to compare the above-described 
region of pl23 (amino acids 437-554) with est2p, too, using identical parameters. This 
showed that, in this chosen region, pi 23 and est2p are 21% identical and that 22% of their 
amino acid residues are either identical or biochemically similar (see Fig. 5). Accordingly, the 
homology between Est+i and Euplotes pl23 is significantly higher than between pl23 and 
est2p. 

Example 4 

The homology of pi 23 with Est+i and est2p suggests that all 3 proteins belong to the same 
protein family. In order to confirm this assumption, est2p was compared with Est+j under the 
conditions described in Example 3 (see Fig. 6). This showed that Est+i is 20% identical to 
est2p, that is exhibits a degree of homology which is comparable to that of pl23 to est2p. 
This comparatively low level of congruence also confirms the finding that no significant EST 
was identified in the tBLASTn search using est2p (see Example 1). 

Example 5 

A computer comparison using pl23, est2p and phTC was carried out in order to identify 
possibly functional domains which are important for the protein family consisting of catalytic 
telomerase subunits derived from different species (see Fig. 7). In this analysis, two regions 
which are present in all three proteins are particularly conspicuous (see Fig. 7). At present, no 
unambiguous function can be assigned to the region which, in pi 23, corresponds to amino 
acids 447 to 460 (Fig. 13, telomerase motif). A motif search using the Genetics Computer 
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Group (GCG) Wisconsin Sequence Analysis Package and a search in a protein data base 
(Swissprot, version of 8.6.1997) did not provide any significant insights. 

On the other hand, a second region which is homologous between pl23, est2p and phTC, 
corresponding in pi 23 to amino acids 512-526, exhibits a consensus motif for a reverse 
transcriptase (RT) (Figs. 7 and 13). Lingner et ai, (1997) showed that pl23/est2p contain a 
total of 6 such RT motifs, which are essential for the catalytic function of pl23/est2p. As 
depicted in Figs. 7 and 13, two such RT motifs are also conserved in the sequence of phTC 
which has been investigated. These motifs are the RT motifs which are located to the furthest 
extent N-terminally in pl23/est2p (Lingner et al,, 1997). 

The primary sequences of reverse transcriptases are strongly divergent; only a few amino 
acids are fully conserved within a separate motif (Poch et al, 1989 and Xiong and Eickbush, 
1990). In addition, due to having different distances between the conserved RT motifs, 
reverse transcriptases which are encoded by retroviruses or long terminal repeat (LTR) 
retroposons differ from those reverse transcriptases which are encoded by non-LTR 
retroposons or group n introns (Xiong and Eickbush, 1990). Based on the structure of their 
RT motifs, pl23, est2p and phTC are to be assigned to the latter RT group. Interestingly, in 
this context, the consensus sequences of the RT motifs in phTC correspond most closely to 
the postulated RT consensus motif: of eight amino acid residues within the two RT motifs, 6 
are present in the case of phTC while only 5 are present in the case of pl23 and esp2p (Figs. 7 
and 13). It is striking in this context how the hydrophobic amino acids, such as leucine and 
isoleucine, and the amino acids lysine and arginine, in particular, are in specific positions 
(Figs. 7 and 13). 

In summary, it was hereby possible to demonstrate, at the descriptive level, that the 
AA281296 clone, identified due to its homology with pi 23, is a fragment of the catalytic 
subunit of human telomerase. 
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Example 6 

For cloning the 5' end of the hTC-cDNA, three consecutive RACE (rapid amplification of 
cDNA ends) reactions were carried out in addition to the homology screening described in 
Example 8. Marathon-Ready cDNA (Clontech) form the human leukaemia cell line K562 or 
from human testis tissue was employed as the cDNA source. The implementation of the 
individual RACE rounds, as well as the results obtained, are described below. 

In addition to this, the sequence information obtained in the RACE rounds was used in order 
to amplify the individual fragments from a contiguous cDNA clone by means of PCR. 

RACE round 1: 

In a final volume of 50 ^1, 10 pmol of dNTP-mix were added to 5 ^1 of K562 Marathon- 
Ready cDNA (from Clontech, Catalogue Number 7441-1), and a PCR reaction was carried 
out in 1 X Klen Taq PCR reaction buffer and 1 x advantage Klen Taq polymerase mix (from 
Clontec). 10 pmol of the internal gene-specific primer GSP2 
(5' -GCAACTTGCTCCAGACACTTCTTCCGG-3') from the 5' region of the hTC-EST 
clone and 10 pmol of the Marathon Adaptor primer API 
(5'-CCATCCTAATACGACTCACTATAGGGC-3'; from Clontech) were added as primers. 
The PCR was carried out in 4 steps. After a one-minute denaturation at 94^C, denaturation 
was then carried out for 5 cycles of 30 sec at 94°C and the primers were then subsequently 
annealed for 4 min at 72°C and the DNA chain was extended. There then followed 5 cycles in 
which the DNA was denatured for 30 sec at 94'^C but the subsequent primer extension took 
place for 4 min at 70°C. Finally, 22 cycles were then carried out in which, after the 30 sec 
DNA denaturation, the primer annealing and chain extension took place for 4 min at 68°C. 

Following this PCR, the PCR product was diluted 1:50. 5 |il of this dilution were used in a 
second "nested" PCR together with 10 pmol of dNTP-mix in 1x10 Klen Taq PCR reaction 
buffer and 1 x Advantage Klen Taq polymerase mix and also 10 pmol of primer GSP2 and 
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lOpmol of the "nested" Marathon Adaptor primer AP2 
(5'-ACTCACTATAGGGCTCGAGCGGC-3'; from Clontech). The PGR conditions 
corresponded to the parameters selected in the first PGR. As the only exception, only 
16 cycles were chosen, instead of 22 cycles, in the last PGR step. 

A DNA fragment of 1153 bp in length was obtained as the product of this nested RACE 
PGR. This fragment was cloned into the TA cloning vector pGR2.1 from Invitrogen and 
subjected to complete double-strand sequencing (Fig. 8 and SEQ JD No. 3). 

Nucleotides 974 to 1153 represent the nucleotide region 1629 to 1808 of the hTC-cDNA 
which is depicted in Fig. 1. The nucleotide region extending from bp 1 to bp 973, which does 
not exhibit any homology with the hTG-cDNA sequence shown in Fig. 1, represents intron 
sequences of the hTG gene (data not shown). A 3' splice consensus sequence is located at the 
exon-intron transition. The presence of intron sequences could be due to using incompletely 
spliced mRNA as the starting substance for the cDNA synthesis. Genomic DNA 
contamination in the cDNA could also be an explanation for intron sequences being found. 

RACE round 2: 

Based on the sequence data obtained in the first RACE round, a second RACE was carried 
out using the gene-specific primer GSP5 from the 5' region of RACE product 1 
(5'-GGCAGTGACCAGGAGGCAACGAGAGG-3') and the API primer. Marathon-Ready 
cDNA from human testis (from Clontech; Catalogue Number 7414-1) was used as the cDNA 
source. The same PCR conditions were selected as in the 1st PCT in RACE round 1. The 1st 
PGR was also followed, in RACE round 2, by a 2nd "nested" PCR using diluted PCR product 
as the cDNA source. The gene-specific primer GSP6 from the 5' region of RACE product 1 
(5'-GGCACACTCGGCAGGAAACGCACATGG-3') and the AP2 primer were used as the 
"nested" PCR primers. The conditions corresponded to parameters for the nested PCR from 
RACE round L 
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The PCR product of 412 bp in length from the nested PGR of RACE round 2 was cloned into 
the TA cloning vector pCRII-Topo from Invitrogen and sequence completely (Fig. 9 and SEQ 
ID No. 4). The sequence segment from bp 267 to bp 412 is completely homologous with the 
5' region of the product from RACE L The region from bp 1 to bp 266 extends RACE 
product 1 at the 5' end. This RACE product 2 is probably, in its entirety, an intron region of 
the hTC gene (data not shown). 

RACE round 3: 

A third RACE round led to the identification of hTC-cDNA regions which were located 
further on in the 5' direction. Using the sequence results from RACE round 2 as a base, a 
gene-specific primer GSP9 (5'-CCTCCTCTGTTCACTGCTCTGGCC-3') was selected from 
the 5' region of RACE product 2 and used in a new RACE together with the API primer and 
Marathon-Ready cDNA from human testis (from Clontech). The RACE conditions were the 
same as those used in the 1st PCR in RACEs 1 and 2. In the "nested" RACE which followed, 
and which took place, in accordance with the "nested" RACEs in rounds 1 and 2, using the 
gene-specific primer GSPIO from the 5' region of RACE product 2 
(5'-CGTAAGTTTATGCAAACTGGACAGG-3') and AP2, a fragment of 1012 bp in length 
(Fig. 10 and SEQ ID No. 5) was amplified and cloned into the TA cloning vector pCRH- 
-TOPO. Subsequent sequencing showed that the 3* region of this RACE fragment (bp 817 - 
bp 1012) evidently still constitutes an intron sequence of the hTC gene. The region from 
bp 889 to bp 1012 is completely homologous with the 5' region of RACE product 2. On the 
other hand, the 5* region of this fragment, from bp 1 to bp 816, is identical to the bp 814 - 
bp 1629 region of the hTC-cDNA which is shown in Fig. 1. A potential 5' splice consensus 
sequence is located at the exon-intron transition. 

Example 7 

A PCR was carried out in order to clone a contiguous fragment from the sequence 
information obtained from RACE 2 and clone AA28 1296. Marathon-Ready cDNA from 
human testis (from Clontech; Catalogue Number 7414-1) was used as the cDNA source. The 
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PCR mixture was as described under RACE 1 (compare Example 6) but using the primers 
C5F (5'-CGAGTGGACACGGTGATCTCTGCC-3') from the 5' region of RACE 2 and 
primer C3B (5'-GCACACCnTGGTCACTCCAAATTCC-3') from a 3' region of clone 
AA281296. The PCR was carried out in 2 steps. After a one-minute denaturation at 94°C, 
denaturation was then carried out for 36 cycles of 30 sec at 94°C and, after that, the primers 
were annealed, and the DNA chain was extended, for 4 min at 68°C. 

A DNA fragment of 2486 bp in length, which is designated the C5F fragment below, was 
obtained as the product of this PCR. This fragment was cloned into the TA cloning vector 
pCRH-TOPO from Invitrogen and subjected to complete double-strand sequencing. A 
comparison of the DNA sequences of the C5F fragment and the AA281296 clone showed 
that there was an in-frame insertion of 182 bp between RT motif 3 and RT motif 4 (Positions 
2352 to 2533, Fig. 1). A further comparison of DNA of the C5F fragment with the sequences 
from the three RACE rounds made it clear that an intron which was already identified in 
RACE 2 was present at the 3' end of C5F. A 3' splice consensus sequence is located at the 
exon-intron transition. In summary, the DNA sequence of the C5F fragment is consequently 
composed of the sequence information shown in Fig. 9 (Position 64 to 278) and the sequence 
data shown in Fig. 1 (Positions 1636 to 3908). 

Example 8 

For cloning the 5' end of the hTC-cDNA, a homology screening (Ausubel et al, 1987) was 
carried out in addition to the RACE protocol described in Example 6. A human 
erythroleukaemia 5 -stretch plus cDNA library (from Clontech, cat. No. HL5016b) from the 
human leukaemia cell line K562 was used as the cDNA source. Approximately 3x10^ Pfu of 
this random and oligo-dT-primed library were plated out and used for screening as described 
in Ausubel etal, (1987). A radioactively labelled hTC-DNA fragment of 719 bp in length 
(Positions 1685 to 2404, corresponding to Fig. 1) was used as the probe. 

Following a rescreening with the same hTC probe, the X clone 12 was verified as being 
positive out of 20 putatively positive X clones. Following plaque purification and X DNA 
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preparation (Ausubel etaL, 1987), the 4kb insert was recloned into the pBluescript vector 
and sequenced (Fig. 1 1 and SEQ ID No. 6). 

A comparison of the X clone 12 sequence with the sequences of the RACE clones and the 
DNA sequence of clone AA281296 showed that this clone, which was identified in the 
homology screening, encodes a 5' part of the hTC-cDNA and possesses a putative ATG start 
codon in Position 63 in accordance with Fig. 1. There is no stop codon in the same reading 
frame 5' of this ATG. Subsequent sequence analyses make it clear that X clone 12 probably 
contains an intron from Positions 1656 to 2004. Very well conserved 5' and 3* splice sites 
provide support for this hypothesis. The hTC-cDNA-encoding sequence then continues from 
Position 2005 to Position 2382. The sequence from 2383 to the 3' end of X clone 12 exhibits a 
conspicuous open reading frame in reading frame -4. A bioinformatic analysis of the 
corresponding DNA sequence showed that, over about 400 bp, this reading frame is identical 
to a variety of ESTs which have no connection with the hTC cDNA. Consequently, 
X clone 12 is a chimeric clone which essentially consists of the 5' end of the hTC cDNA and 
another cDNA clone of unknown function. 

A diagrammatic summary showing the relative orientations of the RACE products, and the 
homology screening, is depicted in Fig. 12. The complete sequence of the hTC cDNA (Fig. 1) 
was assembled from X clone 12 (Positions 21 to 1655 in accordance with Fig. 11), the C5F 
PCR product (Positions 1636 to 3908 in accordance with Fig. 1) and EST AA28 1296 
(Positions 3909 to 4042, in accordance with Fig. 1). 

Example 9 

A total of seven motifs for reverse transcriptases (RT motifs) was identified by comparing the 
phTG protein sequence (Fig. 2 and SEQ ID NO. 2) with a reverse transcriptase consensus 
sequence (Poch et aL, 1989, Xiong and Eickbush, 1990) (Fig. 13). Within these motifs, some 
aniino acids are highly conserved not only between the RT consensus sequence and phTC but 
also in comparison with the Euplotes telomerase protein. Thus, two aspartic acids (Positions 
868 and 869 in Fig. 2) are, for example, completely conserved in RT motif 5 (Fig. 13). RT 
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motif 7, which was deduced from other reverse transcriptases (Poch et al,, 1989, Xiong and 
Eickbush, 1990), was only demonstrated in the human catalytic telomerase subunit and not in 
the Euplotes protein (Fig. 13). 

Structural features which can only be found in the telomerase proteins and not in other 
reverse transcriptases are also conspicuous. The telomerase motif (Positions 553 and 565 in 
Fig. 2) is a structure which is specific for this protein family since it does not occur in any 
previously known protein. A further feature which has only been identified in the catalytic 
telomerase proteins is the difference between RT motifs 3 and 4, which distance, at 
107 amino acids, is markedly greater than in other RTs. These special features indicate that 
the catalytic subunits of the telomerases from different species probably constitute a separate 
subgroup of RNA-dependent DNA polymerases. 

Example 10 

Expression of the telomerase RNA subunit (hTR) does not correlate with telomerase activity 
but, instead, is observed ubiquitously (Feng et ah, 1995). Consequently, the question arises as 
to whether expression of the catalytic telomerase subunit is associated with telomerase 
activity. 

Northern blot experiments (Ausubel et al, 1987) were carried out in order to analyze the 
level of hTC expression. The commercially available Northern blots were supplied with a 
number of RNA preparations from normal human tissue (from Clontech; catalogue 
No. 7760-1) or with RNA samples from human cancer cell lines (from Clontech; Catalogue 
Number 7757-1). A radioactively labelled hTC DNA fragment of 719 bp in length (Positions 
1685 to 2404, in accordance with Fig. 1) was used as the probe. The membranes were 
incubated with the probe in accordance with the manufacturer's (Clontech) instructions. 

Two main RNA transcripts, of about 9.5 kb and 4.4 kb in size, and an additional RNA 
transcript of about 6 kb, which transcripts cross-hybridize with the probe, were detected in the 
eight human cell lines (3 leukaemia cell lines, 3 carcinoma cell lines, one melanoma and one 
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lymphoma) tested (Fig. 15, Fig. A). In the comparison, the hTC mRNA was expressed most 
strongly in the leukaemia cell lines K-562 and HL-60 (Fig. 15, Fig. A). By contrast, it was not 
possible to detect the hTC transcript in the normal tissues (heart, brain, placenta, lung, liver, 
skeletal muscle, kidney and pancreas) which were tested (Fig. 15, Fig. B). This observation is 
not surprising since it was not possible to detect any telomerase activity, either, in these 
tissues (Kim etai, 1994). 

These data indicate that the induction of hTC expression plays an important role in activating 
the telomerase during tumour development. 

Example 11 

Several PCR products, whose sizes only differed from each other to a minimal extent, were 
always obtained when the hTC cDNA fragments from various cDNA libraries (Clontech 
Marathon Ready cDNA from the human leukaemia cell line K562 and from human testis and 
also cDNA from the human premyeloid leukaemia cell line HL60) were subjected to PCR 
amplification. In order to elucidate the differences between the different hTC-PCR products, 
a fragment of the hTC cDNA depicted in Fig. 1 extending from bp 1783 to bp 3901 >Avas 



amplified using the primers C5A (5'-CCGGAAGAGIGTCTGGAGCAAGTTGC-3') and 



C3B (5'-GCACACCTTTGGTCACTCCAAATTCC-3*). Marathon-Ready cDNA from K562 
leukaemia cells (from Clontech; Catalogue Number 7441-1) was used as the cDNA source 
(PCRl and 2). In a third PCR, a hTC fragment, from bp 1695 to bp 3463, of the hTC cDNA 
in Fig. 1 was amplified from HL60 cDNA using the primers GSPl front 





(5'-GGCTGATGAGTGTGTACGTCGTCGAG-3') 
(5-GGGTGGCCATCAGTCCAGGATGG-3'). 



and 



HTRT3A 



The conditions of the 3 PCR reactions are described below: 



In the first PCR, and in a final volume of 50 |al, 10 pmol of dNTP mix were added to 5 |il of 
K562 Marathon-Ready cDNA, and a PCR reaction was carried out in 1 x Klen Taq PCR 
reaction buffer and 1 x Advantage Klen Taq polymerase mix (from Clontech). 10 pmol of 
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each of the primers C5A and C5B were added. The PGR was carried out in 3 steps. A 
one-minute denaturation at 94°C was followed by 35 PGR cycles in which the DNA was 
firstly denatured for 30 sec at 94°C and the primers were then annealed, and the DNA chain 
was extended, for 4 min at 68°G. In conclusion, there followed a chain extension for 10 min 
at The resulting PGR products were cloned into the TA cloning vector pGRQ-TOPO 
from Invitrogen. 

In a second PGR, 10 pmol of each of the primers C5A and G3B, 10 pmol of dNTP mix and 

2 U of Taq DNA polymerase (from Gibco-BRL) were added to 5 [xl of K562 Marathon- 
Ready cDNA, and a PGR reaction was carried out in 1 x PGR buffer (from Perkin Elmer) in a 
final volume of 50 |aL The PGR reaction was carried out in 3 steps. The DNA was firstly 
denatured for 3 min at 94°G. There then followed 34 cycles in which, consecutively, the 
DNA was denatured for 45 sec at 94°G, primer annealing then took place for 1 min at 68°G 
and, after that, the DNA chain was extended for 3 min at 72°G. In the last PGR step, a 
concluding chain extension was carried out for 10 min at 72°G. The resulting PGR products 
were cloned into the TA cloning vector pGR2.1 from Invitrogen. 

For the third PGR, the cDNA synthesis kit from Boehringer Mannheim was first of all used to 
carry out a cDNA synthesis from 2 ^g of DNasel-treated poly-A RNA from the human 
premyeloid cell line HL60 in accordance with the manufacturer's instructions. 1 \i\ of this 
HL60 cDNA was then mixed with 10 pmol of each of the primers GSPl front and HTRT3A 
and also 10 pmol of dNTP mix, in a final volume of 50 ^1, and, after 1.25 \i\ of DMSO in 

i. 

1 X Klen Taq PGR reaction buffer and 1 x Advantage Klen Taq polymerase mix (from 
Glontech) had been added, a PGR reaction was carried out. The PGR reaction proceeded in 

3 steps. After a denaturation for 3 min at 94°G, the DNA was initially denatured for 1 min at 
94°G and the primers were then annealed, and the DNA chain extended, for 4 min at 68°G, 
over 37 cycles. The reaction was concluded by a further incubation for 10 min at 68°G. The 
PGR products were cloned into the TA cloning vector pGR2.1-T0P0. 
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Complete double-strand sequencing of the cloned hTC cDNA fragments from PCRs 1 and 2, 
and partial sequencing of the hTC cDNA fragments obtained from PCR 3, showed that, in 
addition to the hTC cDNA depicted in Fig. 1, 4 variants of this cDNA exist in human cells, 
i.e.: 

Variant 1 of human hTC cDNA is distinguished by a deletion of 182 bp in length extending 
from nucleotides 2345 to 2526. This deletion results in the ORF being displaced, with a 
truncated hTC protein, which lacks RT motifs 4 to 7, being read off. 

Variant 2 of human hTC cDNA exhibits a deletion of 36 bp in length extending from 
nucleotides 2184 to 2219. RT motif 3 is lost as a result of this deletion. However, the reading 
frame is retained and a protein is produced which selectively lacks RT motif 3. 

Variant 3 of human hTC cDNA is a combination of variants 1 and 2. It exhibits both a 
deletion from bp 2184 to 2219 and a deletion from bp 2345 to 2526. 

Variant 4 of human hTC cDNA is distinguished by the loss of the nucleotide region from 
bp 3219 to bp 3842. This missing sequence is replaced by a sequence which is not 
homologous with hTC. From bp 3843 onwards, the sequence is once again completely 
identical to the hTC sequence depicted in Fig. 1. The sequence of variant 4 is shown in 
Fig. 14. In accordance with the 5' primer chosen, it begins with bp 1783 of the hTC cDNA 
shown in Fig. 1. The region which is not homologous is emphasized in bold and, from 
Position 3219 to Position 3451 (Fig. 14 and SEQIDNo.7) is, to the extent of 98.7%, in 
agreement, at the DNA level, with an EST (Accession No. AA299878) from a human uterus 
tumour. 

Exanple 12 

In order to obtain antisera having specificity for the catalytic subunit of human telomerase, 
the available nucleotide sequence (Fig. 1) was translated into an amino acid sequence 
(Fig. 2). Using a secondary structure prediction program (PROTEAN, from the DNAStar 
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software package. DNASTAR Inc., Madison, WI, USA), two peptides we. chosen w.c^^^ 
1 a ceL de^e of p— ty, evoKe an — e .sponse. These a. the Mowing 
peptides, which are depicted in the one-letter code for an^no aads: 

B- C-K-R-V-Q-L-R-E-L-S-E-A-E-V-R-Q - CONH,/Pos. 594 - 608 

C C-Q-E-T-S-P-L-R-D-A-V-V-I-E-Q-S-S-S-L-N-E - CONH./Pos. 781-800 

A lin^H are not derived from the telomerase sequence but were 
The cysteines which are underlined are not aenveu 

additionally added on as linkers for the coupling. 

tn kevhole limpet hemocyanin (KLH) using the thiol-reactive 
The neptides were coupled to keytioie imipci j 

J . each case — w„h ^ coupled pepUdes a, in,e.als of 2jo 4 w^. 
.„ —0. 5 n,, of b,c^ were w,md«w„ in .0 ob» p,e— - 
, 1 4 i™.u.za.o„s, 5 of bloC we„ «.ew,» wUhOrawn f« ob,a.™„, — - 
Tbl se„ we. .esfea fo, ,eac.v,. wi* f.s.on p».e,ns (Exa.p,e ,3, in a Wes.™ b,o„,n. 

experiment (Ausubel a/., 1987). 

Example 13 

■ nPrinnents were carried out in order to be able to analyse the protein of 
Bacterial expression experiments were camcu 

the catalytic telomerase subunit. 



25 



30 



The constructs of these experiments are described below: 

Por the expression construct pMalEST, the inseri in the AA281296 clone mentioned m 
Example2wasexcisedwithrestncrionenzymesEcoRIandNotIandthecleavage^.^^^^^^^^ 

ednuslngtheKlenowfragment(AusubeU.^,.87);theinsertwasthenc^ 

. .f the maltose-binding protein of the bacterial expression vector 
given reading frame of the maltose oinams v ^^nction 
. XT p ai.nd Biolabs) Vector pMAL-C2 was digested with restriction 
pMAL-C2 (from New England Bioiaosj. vcuiui v 
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H- .inde strand ends were removed with T4 DNA polymerase 
enzyme Pst 1 and the protruding single-strana 



(Ausubel et al, 1987). 



contains the nucleotide sequence of Fig. 1 from 
The expression construct pMalM contains commercially 

. ■ onnQ Thi<i DNA fragment was amphtiea irom <x 
P„U,o. ,789 .0 Pos,.,on390 . ^-^^ ^^^^^^ „ ,„„,„,44,..) 

— of PCR "J,;,,,„cC-^^ and doned ,n,o the TA Coning 

and C3B ,,-GCACACCTrrGOTCACTCCA ^ ^^^^ _^ ^^^^^^ 

PCRH-TOPO ^^-^^ 1 „.Hc,i™ enz^e 

P„ .e e.p«ssion — pMalAl ,,„,„^, 

:r— ------ 

p.. _ - - :r ::::tr — ^^^^ 

«p«s«,„n cond«ons « t^acen. „sa,e. wMcH prepaid 

New England BWabs (Catalogue Number 800). The ^ 
„e« «ed in a Western Hotting exp^nment (Ausube, e,al.. 1987). 




> 



. n wei^ analysed in a Western blot (Ausubel et ai, 1987) 

The bacterial lysates from Example 13 were anaiy 

using the antisera from Example 12. 

. . on of the fusion represented by the maltose-binding protein is about 43 ld.a 
Sincetheproportionof thef. ^ ^^^^^^^ ^^,,3, 

in size, fusion proteins of about 74 kua 

pMalAl constructs, respectively. 

When contpanng the p— with the se^ 

e*n. that spectne — "-^^^ Z^^. 
Furthermore, in addition to the expected 74 kDa and 
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.Uo observed which react with the antisera. These smaller products 
protein fragments were also observeo wi 

probably originate from premature products. 

R is oresent on the fusion protein from the expression using 
only the epitope for serum B . prese^ ^^^^^ ^^^^^ ^^^^ 

PM.HST. By contrast, the eptopes --2lTJoJ.^ pM^BST expression prCuct 
pMalM. For th. reason, antiserum C dc.s ^ ^^^^^^^^ 

^^^^^"^ " ":ri"r^eeofspectn.tyoftheant,^ 
pMal Al . This observation underlmes tne mgn 
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generated. 



ExamplelS 



30 



, th. nrotein of the catalytic telomerase subunit. the protein 
Tn order to be able to analyse the protem oim j 

• frn tncether with the RNA component, 
component should be reconstituted m vitro together w,t 

The constructs for these expenments at. described below: 

. f S04 nt in length (Feng et al, 1995) was amplified from a 293 cell 
The RNA component of 504 ntmlengt g ^^^^^ 

cDNA library usmg ^^.r^r'rTrrnCCTG-3') and HrR2BAM 

volume of 100 ^. .0 3 ^1 o abco). 10 pmol of each 

, . PCR taction buffer con..n,ng 0.5 Ml of Ta, p j™ ^ 

. muORAM and HTR2BAM were added. Tie PCR was cam 

of d,e pnn^rs HrR9BAM a d ^ ^^^^^ „^ 

. — " ' - ^, „ ^„c. were annealed, and 

of an denatured for one ^ .e .9 ^ 

a,e DNA chain was exKnded, for 2 nun a. 

■ r „,i„ a. 72-C The resulUng PCR praducis were cloned, after 
extension for 4 min at 1" ...noinsuchawaythatthe 
„^.tion with Ba. HI, tnto the Batn H. Ceava^e site of vector pUC19 in 
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RNA component is under the control of the T7 promoter. This construct is designated 
HTR504 in that which follows. 

The cDNA fragment of 3411 bp in length (Position 60 to Position 3470, Fig. 1) was cloned 
into the vector PCRII TOPO (from Invitrogen). Detailed information on the cloning is given 
in Examples 8 and 7, and also in Fig. 12. In this construct, which is designated HTC FL, the 
T7 promoter is located 5' before the hTC cDNA. 

The catalytic telomerase protein component was synthesized in a commercially available 
transcription/translation system, after adding the hTC FL construct, in accordance with the 
manufacturer's (Promega; Catalogue Number L4610) instructions. Whether the in vitro 
translation of the expected 127 kDa product had been successful was checked in an 
SDS-PAGE (Ausubel et al, 1987) using ^^S-labelled cysteine (Fig. 17). 

The telomerase RNA component was synthesized using a transcription system in accordance 
with the manufacturer's (Ambion; Catalogue Number 1344) instructions or using the method 
described by Pokrovskaya and Gurevich (1994). 

For the in vitro re-constitution, 0.5 jig of hTRNA was added to 50 |il of the above-described 
translation mixture containing the hTC FL construct and the whole was incubated at 3VC for 
10 min. The enzymatic activity of 2 |il of this mixture was investigated using the TRAP assay 
(N.W. Kim et al., 1994). The measurement of the activity, by the same method, of telomerase 
which was purified from HeLa cells (Shay etal.y 1994) was used as the positive control. As 
can be seen in Fig. 18, both the reconstituted enzyme and the native enzyme produce the 
same product pattern, i.e. the nucleotide ladder which is characteristic for telomerase. This 
result also verifies that a single protein component, together with the RNA, is sufficient for 
the enzymatic telomerase activity. 

In addition to the described TRAP assay, 5 |il of the reconstitution mixture were tested for its 
activity in a direct telomerase assay (Shay era/., 1994). In this experiment, too, the 
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characteristic nucleotide ladder verifies the successful reconstitution of recombinant hTC 
protein and telomerase RNA component. 

In summary, it was hereby possible to demonstrate, at the functional level, that the identified, 
and completely cloned, hTC-cDNA constitutes the catalytic subunit of human telomerase. 
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its diagnostic andVherapeutic use 

(iii) NUMBER OF SEQUENCES: V 

(iv) COMPUTER-READABLE FORm\ 

(A) MEDIUM TYPE: Flopp\ disk 

(B) COMPUTER: IBM PC coftroatible 

(C) OPERATING SYSTEM: PC Y>OS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30B (EPA) 



(2) INFORMATION FOR SEQ ID NO: 1: \r> 

(i) SEQUENCE CHARACTERISTICS: \ 

(A) LENGTH: 4042 Basenpaare \ 

(B) TYPE: Nucleotide 

(C) STRANDEDNESS : Einzelstrang 

(D) TOPOLOGY: Linear 

(ii) ART DES MOLEKsLS : cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL/ ISOLATE: Human 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: \ 

GTTTCAGGCA GCGCTGCGTC CTGCTGCGCA CGTGGGAAGC CCTGGcVcCG GCCACCCCCG 60 

CGATGCCGCG CGCTCCCCGC TGCCGAGCCG TGCGCTCCCT GCTGCGCAGC CACTACCGCG 120 

AGGTGCTGCC GCTGGCCACG TTCGTGCGGC GCCTGGGGCC CCAGGGCTOG CGGCTGGTGC 180 

AGCGCGGGGA CCCGGCGGCT TTCCGCGCGC TGGTGGCCCA GTGCCTGGTgX TGCGTGCCCT 240 

GGGACGCACG GCCGCCCCCC GCCGCCCCCT CCTTCCGCCA GGTGTCCTGC STGAAGGAGC 300 

TGGTGGCCCG AGTGCTGCAG AGGCTGTGCG AGCGCGGCGC GAAGAACGTG CKSGCCTTCG 360 

GCTTCGCGCT GCTGGACGGG GCCCGCGGGG GCCCCCCCGA GGCCTTCACC ACOAGCGTGC 420 

GCAGCTACCT GCCCAACACG GTGACCGACG CACTGCGGGG GAGCGGGGCG TGGQGGCTGC 480 



Le A 32 486-Foreign c^ ^ries 

-46- 

TGCTGCGCCG CGTGGGC«AC GACGTGCTGG TTCACCTGCT GGCACGCTGC GCGCTCTTTG 540 

TGCTGGTGGC TCCCAGCTCC GCCTACCAGG TGTGCGGGCC GCCGCTGTAC CAGCTCGGCG 600 

CTGCCACTCA GGCCCGGCOC CCGCCACACG CTAGTGGACC CCGAAGGCGT CTGGGATGCG 660 

AACGGGCCTG GAACCATAGQ GTCAGGGAGG CCGGGGTCCC CCTGGGCCTG CCAGCCCCGG 720 

GTGCGAGGAG GCGCGGGGGC VgTGCCAGCC GAAGTCTGCC GTTGCCCAAG AGGCCCAGGC 780 

GTGGCGCTGC CCCTGAGCCG gVgCGGACGC CCGTTGGGCA GGGGTCCTGG GCCCACCCGG 840 

GCAGGACGCG TGGACCGAGT GASCGTGGTT TCTGTGTGGT GTCACCTGCC AGACCCGCCG 900 

AAGAAGCCAC CTCTTTGGAG GGTGCGCTCT CTGGCACGCG CCACTCCCAC CCATCCGTGG 960 

GCCGCCAGCA CCACGCGGGC CCCCCATCCA CATCGCGGCC ACCACGTCCC TGGGACACGC 1020 

CTTGTCCCCC GGTGTACGCC GAGACCAAGC ACTTCCTCTA CTCCTCAGGC GACAAGGAGC 1080 

AGCTGCGGCC CTCCTTCCTA CTCAGOTCTC TGAGGCCCAG CCTGACTGGC GCTCGGAGGC 1140 

TCGTGGAGAC CATCTTTCTG GCTTCCyJEGC CCTGGATGCC AGGGACTCCC CGCAGGTTGC 1200 

CCCGCCTGCC CCAGCGCTAC TGGCAAAmSC GGCCCCTGTT TCTGGAGCTG CTTGGGAACC 1260 

ACGCGCAGTG CCCCTACGGG GTGCTCCTCA AGACGCACTG CCCGCTGCGA GCTGCGGTCA 1320 

CCCCAGCAGC CGGTGTCTGT GCCCGGGAck AGCCCCAGGG CTCTGTGGCG GCCCCCGAGG 1380 

AGGAGGACAC AGACCCCCGT CGCCTGGTGQ AGCTGCTCCG CCAGCACAGC AGCCCCTGGC 1440 

AGGTGTACGG CTTCGTGCGG GCCTGCCTGC \gCCGGCTGGT GCCCCCAGGC CTCTGGGGCT 1500 

CCAGGCACAA CGAACGCCGC TTCCTCAGGA AcaCCAAGAA GTTCATCTCC CTGGGGAAGC 1560 

ATGCCAAGCT CTCGCTGCAG GAGCTGACGT GG^GATGAG CGTGCGGGAC TGCGCTTGGC 1620 

TGCGCAGGAG CCCAGGGGTT GGCTGTGTTC CGaCCGCAGA GCACCGTCTG CGTGAGGAGA 1680 

TCCTGGCCAA GTTCCTGCAC TGGCTGATGA GTGTGTACGT CGTCGAGCTG CTCAGGTCTT 1740 

TCTTTTATGT CACGGAGACC ACGTTTCAAA AGAACAGGCT CTTTTTCTAC CGGAAGAGTG 1800 

TCTGGAGCAA GTTGCAAAGC ATTGGAATCA GACAaCACTT GAAGAGGGTG CAGCTGCGGG 1860 

AGCTGTCGGA AGCAGAGGTC AGGCAGCATC GGGAAGCCAG GCCCGCCCTG CTGACGTCCA 1920 

GACTCCGCTT CATCCCCAAG CCTGACGGGC TGCGGQCGAT TGTGAACATG GACTACGTCG 1980 

TGGGAGCCAG AACGTTCCGC AGAGAAAAGA GGGCCGAGCG TCTCACCTCG AGGGTGAAGG 2040 

CACTGTTCAG CGTGCTCAAC TACGAGCGGG CGCGGCGCCC CGGCCTCCTG GGCGCCTCTG 2100 

TGCTGGGCCT GGACGATATC CACAGGGCCT GGCGCACQTT CGTGCTGCGT GTGCGGGCCC 2160 

AGGACCCGCC GCCTGAGCTG TACTTTGTCA AGGTGGATGT GACGGGCGCG TACGACACCA 2220 

TCCCCCAGGA CAGGCTCACG GAGGTCATCG CCAGCATCAT CAAACCCCAG AACACGTACT 2280 

GCGTGCGTCG GTATGCCGTG GTCCAGAAGG CCGCCCATGQ GCACGTCCGC AAGGCCTTCA 2340 
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.O^OCCCOT CTCTAcLo «TCC AGCCOTAC.T OTOGCTCACC 
.OC«C C^GCCCC^O AOCOATGCCO TCOTCATCC GCACAGCTCC TCCCTC^^TG 
.OGCCOCAO TGCCCTCTtX G.COTCTTCC T.COCTTC.T GTGCCACCAC GCCGTGCGC. 
TCAGGGGC^A OTCCTACGTcVgTGCCAGG GGATCCCGC. GGGCTCCATC CTCTCCACGC 
TGCTCTGCAG CCTGTGCTAC \gCGACATGG AGAACA.GCT GTTTGCGGGG ATTCGGCGGG 
.CGGGCTGCT CCTGCGTTTG g\gGATGATT TCTTGTTGGT G.CACCTCAC CTCACCCACG 
CCAAAACCTT CCTCAGGACC CtWcGAG GTGTCCCTGA GTATGGCTGC GTGGTGAACT 
.OCGGAAGAC AGTGGTGAAC TTcVcTGTAG AAGACGAGGC CCTGGGTGCC ACGGCTTTTG 
TTCAGATGCC GGCCCACGGC CTA^CCCCT GGTGCGGCCT GCTGCTGGAT ACCCGGACCC 
.OGAGGTGCA GAGCGACTAC TCCaLaTG CCCGGACCTC CATCAGAGCC AGTCTCACCT 

^\,^o,v mrrcTrrCJ^ ACTCTTTGGG GTCTTGCGGC 
TCAACCGCGG CTTCAAGGCT GGGAG(^CA TGCGTCGCAA ACTLi 

TOAAGTOTCA CAGCCTGTTT CTGGA^C AGGTGAACAG CCTCCAGACG GTGTGCACCA 
ACATCTACAA GATCCTCCTG CTGCAgI^GT ACAGGTT.CA CGCATGTGTG CTGCAGCTCC 
CATTTCATCA GCAAGTTTGG AAGAACCck CATTTTTCCT GCGCGTCATC TCTGACACGG 
CCTCCCTCTG CTACTCCATC CTGAAAGcA AGAACGCAGG GATGTCGCTG GGGGCCAAGG 
OCGCCGCCGG CCCTCTGCCC TCCGAGGCCgWaGTGGCT GTGCCACCAA GCATTCCTGC 

\rrrB,rTCCT GGGGTCACTC AGGACAGCCC 
TCAAGCTGAC TCGACACCGT GTCACCTACG toCACTCCT i.^^^ 

AOACGCAGCT GAGTCGGAAG CTCCCGGGGA ckcGCTGAC TGCCCTGGAG GCCGCAGCCA 
.CCCGGCACT GCCCTCAGAC TTCAAGACCA TC^GGACTG ATGGCCACCC GCCCACAGCC 
.CGCCGAGAG CAGACACCAG CAGCCCTGTC ACG CTACGTCCCA GGGAGGGAGG 

OGCGGCCCAC ACCCAGGCCC GCACCGCTGG GAGT^GAGG CCTGAGTGAG TGTTTGGCCG 
.OGCCTGCAT GTCCGGCTGA AGGCTGAGTG TCCGg\tGAG GCCTGAGCGA GTGTCCAGCC 

^o,^ ^rrTrrCCTC TTCACt\cCC CACAGGCTGG CGCTCGGCTC 
AAGGGCTGAG TGTCCAGCAC ACCTGCCGTC 

r-norrrrVTr CACTCCCCAC ATAGGAATAG 
CACCCCAGGG CCAGCTTTTC CTCACCAGGA GCCCGGCTTC CACIt.c^^ 

TCCATCCCCA GATTCGCCAT TGTTCACCCC TCGCCCTgW CTCCTTTGCC TTCCACCCCC 
.CCATCCAGG TGGAGACCCT GAGAAGGACC CTGGGAGc4c TGGGAATTTG GAGTGACCAA 
.OGTGTGCCC TGTACACAGG CGAGGACCCT GCACCTGGaV GGGGGTCCCT GTGGGTCAAA 
.TGGGGGGAG GTGCTGTGGG AGTAAAATAC TGAATATATg\aGTTTTTCAG TTTTGAAAAA 

^^^^j^AAAAAAA AAAAAAAAAA AA 
(2) INFORMATION FOR SEQ ID NO: 2: 
(i) SEQUENCE CHARACTERISTICS: 



2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4042 
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(A) LENGTHS 1132 amino acids 

(B) TYPE: Apiino acid 

(C) STRANDeWeSS : Single 

(D) TOPOLOG^ Linear 

(ii) ART DES MOLEKgLS: Protein 

(iii) HYPOTHETICAL: NC 

(iv) ANTI SENSE: NO 

(vi) ORIGINAL SOURCE 

(C) INDIVIDUAL/ IS\)LATE: Hiiman 



(xi) SEQUENCE DESCRIPTION^ SEQ ID NO: 2: 

Met Pro Arg Ala Pro Arg cVs Arg Ala Val Arg Ser Leu Leu Arg Ser 
1 ' 5 \ 10 15 



His Tyr Arg Glu Val Leu Prc\ Leu Ala Thr Phe Val Arg Arg Leu Gly^ 



20 



25 



30 



Pro Gin Gly Trp Arg Leu Val 
35 



Arg Gly Asp Pro Ala Ala Phe Arg 

45 



Ala Leu Val Ala Gin Cys Leu VaU Cys Val Pro Trp Asp Ala Arg Pro 
50 55 \ 60 

Pro Pro Ala Ala Pro Ser Phe ArgXGln Val Ser Cys Leu Lys Glu Leu 
65 70 \ 75 80 

Val Ala Arg Val Leu Gin Arg Leu cVs Glu Arg Gly Ala Lys Asn Val 

85 \ 90 95 

Leu Ala Phe Gly Phe Ala Leu Leu AsA Gly Ala Arg Gly Gly Pro Pro 

100 105\ 110 

Glu Ala Phe Thr Thr Ser Val Arg Ser ¥yr Leu Pro Asn Thr Val Thr 
115 120 \ 125 

Asp Ala Leu Arg Gly Ser Gly Ala Trp GlV Leu Leu Leu Arg Arg Val 
130 135 \ 140 

Gly Asp Asp Val Leu Val His Leu Leu AlaV^rg Cys Ala Leu Phe Val 
145 150 155 160 

Leu Val Ala Pro Ser Cys Ala Tyr Gin Val cVs Gly Pro Pro Leu Tyr 

165 170 \ 175 

Gin Leu Gly Ala Ala Thr Gin Ala Arg Pro PrA Pro His Ala Ser Gly 

180 185 \ 190 

Pro Arg Arg Arg Leu Gly Cys Glu Arg Ala Trp ksn His Ser Val Arg 
195 200 \ 205 

Glu Ala Gly Val Pro Leu Gly Leu Pro Ala Pro Gly Ala Arg Arg Arg 
210 215 22 



Gly Gly Ser Ala Ser Arg Ser Leu Pro Leu Pro Lys v^rg Pro Arg Arg 
225 230 235 \ 240 
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Gly Ala Ala\Pro Glu Pro Glu Arg Thr Pro Val Gly Gin Gly Ser Trp 

245 250 255 

Ala His Pro Qly Arg Thr Arg Gly Pro Ser Asp Arg Gly Phe Cys Val 

2V0 265 270 

Val Ser Pro Alb Arg Pro Ala Glu Glu Ala Thr Ser Leu Glu Gly Ala 
275 \ 280 285 

Leu Ser Gly ThrVArg His Ser His Pro Ser Val Gly Arg Gin His His 
290 \ 295 300 

Ala Gly Pro Pro S^r Thr Ser Arg Pro Pro Arg Pro Trp Asp Thr Pro 
305 \ 310 315 320 

Cys Pro Pro Val TyA Ala Glu Thr Lys His Phe Leu Tyr Ser Ser Gly 

325\ 330 335 



Asp Lys Glu Gin Leu 

340 



^rg Pro Ser Phe Leu Leu Ser Ser Leu Arg Pro 

345 350 



Ser Leu Thr Gly Ala Arb Arg Leu Val Glu Thr lie Phe Leu Gly Ser 

355 \ 360 365 

Arg Pro Trp Met Pro Gly Vhr Pro Arg Arg Leu Pro Arg Leu Pro Gin 
370 ' 380 



Arg Tyr Trp Gin Met Arg 
385 390 



Leu Phe Leu Glu Leu Leu Gly Asn His 

395 400 



Ala Gin Cys Pro Tyr Gly Val 

• 405 



^Leu Leu Lys Thr His Cys Pro Leu Arg 
410 415 



Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu Lys Pro Gin 

420 \ 425 430 

Gly Ser Val Ala Ala Pro Glu G1u\g1u Asp Thr Asp Pro Arg Arg Leu 
435 440 \ 445 

Val Gin Leu Leu Arg Gin His Ser shr Pro Trp Gin Val Tyr Gly Phe 
450 455 \ 460 

Val Arg Ala Cys Leu Arg Arg Leu Val\pro Pro Gly Leu Trp Gly Ser 
465 470 \ 475 480 

Arg His Asn Glu Arg Arg Phe Leu Arg A^n Thr Lys Lys Phe lie Ser 

485 49)P 495 



Leu Gly Lys His Ala Lys Leu Ser Leu 

500 505 



GlmGlu Leu Thr Trp Lys Met 

510 



Ser Val Arg Asp Cys Ala Trp Leu Arg Arg ser Pro Gly Val Gly Cys 
515 520 \ 525 

Val Pro Ala Ala Glu His Arg Leu Arg Glu GlA lie Leu Ala Lys Phe 
530 535 \540 

Leu His Trp Leu Met Ser Val Tyr Val Val Glu aeu Leu Arg Ser Phe 
545 550 555 \ 560 



Phe Tyr Val Thr Glu Thr Thr Phe Gin Lys Asn Arg Leu Phe Phe Tyr 

565 570 \ 575 
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Arg Lys Ser VaA Trp Ser Lys Leu Gin Ser lie Gly lie Arg Gin His 

580\ 585 590 

Leu Lys Arg Val Gin Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gin 
595 \ 600 605 

His Arg Glu Ala Arg\Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe lie 
610 \ 615 620 

Pro Lys Pro Asp Gly lW Arg Pro lie Val Asn Met Asp Tyr Val Val 
625 6^0 635 640 

Gly Ala Arg Thr Phe ArA Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser 

645 \ 650 655 

Arg Val Lys Ala Leu Phe ^er Val Leu Asn Tyr Glu Arg Ala Arg Arg 

660 \ 665 670 

Pro Gly Leu Leu Gly Ala SeV Val Leu Gly Leu Asp Asp lie His Arg 
675 \ 680 685 

Ala Trp Arg Thr Phe Val Leu\Arg Val Arg Ala Gin Asp Pro Pro Pro 
690 695 \ 700 



Glu Leu Tyr Phe Val Lys Val 
705 710 



Val Thr Gly Ala Tyr Asp Thr lie 
715 720 



Pro Gin Asp Arg Leu Thr Glu Val lie Ala Ser lie lie Lys Pro Gin 

725 \ 730 735 

Asn Thr Tyr Cys Val Arg Arg Tyr ^la Val Val Gin Lys Ala Ala His 

740 ^45 750 

Gly His Val Arg Lys Ala Phe Lys S^ His Val Ser Thr Leu Thr Asp 
755 760 \ 765 

Leu Gin Pro Tyr Met Arg Gin Phe Val Via His Leu Gin Glu Thr Ser 
770 775 \ 780 



Pro Leu Arg Asp Ala Val Val lie Glu C 
785 790 

Ala Ser Ser Gly Leu Phe Asp Val Phe Leu 

805 810 



Ser Ser Ser Leu Asn Glu 
795 800 

^Arg Phe Met Cys His His 

815 



Ala Val Arg lie Arg Gly Lys Ser Tyr Val Giln Cys Gin Gly lie Pro 

820 825 \ 830 

Gin Gly Ser He Leu Ser Thr Leu Leu Cys Set Leu Cys Tyr Gly Asp 
835 840 \ 845 

Met Glu Asn Lys Leu Phe Ala Gly He Arg ArgVAsp Gly Leu Leu Leu 
850 855 B60 

Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro m.s Leu Thr His Ala 
865 870 875 \ 880 

Lys Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pri Glu Tyr Gly Cys 

885 - 890 \ 895 



Val Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro yal Glu Asp Glu 
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900 



905 



910 



Ala Leu Gl\ Gly Thr Ala Phe Val Gin Met Pro Ala His Gly Leu Phe 
915\ 920 925 

Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu Val Gin Ser 
930 \ 935 940 

Asp Tyr Ser SeV Tyr Ala Arg Thr Ser lie Arg Ala Ser Leu Thr Phe 
945 \ 950 955 960 

Asn Arg Gly PheyLys Ala Gly Arg Asn Met Arg Arg Lys Leu Phe Gly 

65 970 975 

Val Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gin Val Asn 

980 \ 985 990 



Ser Leu Gin Thr 
995 



ValVCys Thr Asn lie Tyr Lys lie Leu Leu Leu Gin 

1000 1005 



Ala Tyr Arg Phe His ^Jla Cys Val Leu Gin Leu Pro Phe His Gin Gin 
1010 \yi015 1020 

Val Trp Lys Asn Pro Tl^?Phe Phe Leu Arg Val lie Ser Asp Thr Ala 
1025 102\ 1035 1040 



Ser Leu Cys Tyr Ser lie Leu Lys Ala Lys Asn Ala Gly Met Ser Leu 

1045 \ 1050 1055 

Gly Ala Lys Gly Ala Ala GlV Pro Leu Pro Ser Glu Ala Val Gin Trp 

1060 \ 1065 1070 

Leu Cys His Gin Ala Phe Leu Leu Thr Arg His Arg Val Thr 

1075 \pSQ 1085 

Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gin Thr Gin Leu Ser 
1090 1095 \ 1100 



Arg Lys Leu Pro Gly Thr Thr Leu 
1105 1110 



^Thr Ala Leu Glu Ala Ala Ala Asn 
1115 1120 



Pro Ala Leu Pro Ser Asp Phe Lys Tnr lie Leu Asp 

1125 \ 1130 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1153 base pairs 

(B) TYPE: Nucleotide 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL/ ISOLATE: HumaTl 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GTGCCTGCAG AGACCQGTCT GGTGCACTCT GATTCTCCAC TTGCCTGTTG CATGTCCTCG 60 

TTCCCTTGTT TCTCACC^CC TCTTGGGTTG CCATGTGCGT TTCCTGCCGA GTGTGTGTTG 120 

ATCCTCTCGT TGCCTCCTOG TCACTGGGCA TTTGCTTTTA TTTCTCTTTG CTTAGTGTTA 180 

CCCCCTGATC TTTTTATTcX CGTTGTTTGC TTTTGTTTAT TGAGACAGTC TCACTCTGTC 240 

ACCCAGGCTG GAGTGTAATG >GCACAATCTC GGCTCACTGC AACCTCTGCC TCCTCGGTTC 300 

AAGCAGTTCT CATTCCTCAA OCTCATGAGT AGCTGGGATT ACAGGCGCCC ACCACCACGC 360 

CTGGCTAATT TTTGTATTTT TaVtAGAGAT AGGCTTTCAC CATGTTGGCC AGGCTGGTCT 420 

CAAACTCCTG ACCTCAAGTG ATC^CCCGC CTTGGCCTCC CACAGTGCTG GGATTACAGG 480 

TGCAAGCCAC CGTGCCCGGC ATACOTTGAT CTTTTAAAAT GAAGTCTGAA ACATTGCTAC 540 

CCTTGTCCTG AGCAATAAGA CCCTTACTGT ATTTTAGCTC TGGCCACCCC CCAGCCTGTG 600 

% TGCTGTTTTC CCTGCTGACT TAGTTCTATO TCAGGCATCT TGACACCCCC ACAAGCTAAG 660 

:t=; CATTATTAAT ATTGTTTTCC GTGTTGAgJ^TTTCTTTAGC TTTGCCCCCG CCCTGCTTTT 720 

=P CCTCCTTTGT TCCCCGTCTG TCTTCTGTCtXcaGGCCCGCC GTCTGGGGTC CCCTTCCTTG 780 

TCCTTTGCGT GGTTCTTCTG TCTTGTTATT qCTGGTAAAC CCCAGCTTTA CCTGTGCTGG 840 

CCTCCATGGC ATCTAGCGAC GTCCGGGGAC CTCTGCTTAT GATGCACAGA TGAAGATGTG 900 

GAGACTCACG AGGAGGGCGG TCATCTTGGC CCOTGAGTGT CTGGAGCACC ACGTGGCCAG 960 

Ui CGTTCCTTAG CCAGGGTTGG CTGTGTTCCG GCCgWgaGC ACCGTCTGCG TGAGGAGATC 1020 

CTGGCCAAGT TCCTGCACTG GCTGATGAGT GTGTAQGTCG TCGAGCTGCT CAGGTCTTTC 1080 

:y TTTTATGTCA CGGAGACCAC GTTTCAAAAG AACAGGOTCT TTTTCTACCG GAAGAGTGTC 1140 

TGGAGCAAGT TGC \ 1153 
(2) INFORMATION FOR SEQ ID NO: 4: \ 

(i) SEQUENCE CHARACTERISTICS: \ 

(A) LENGTH: 412 base pairs \ 

(B) TYPE: Nucleotide \ 

(C) STRANDEDNESS : Single \ 

(D) TOPOLOGY: Linear \ 

(ii) MOLECULE TYPE: cDNA \ 

(iii) HYPOTHETICAL: NO \ 

(iv) TU^TISENSE: NO \ 

(vi) ORIGINAL SOURCE: \ 
(C) INDIVIDUAL/ ISOLATE: Human \ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: \ 
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CTCCATCGTC 


ACGTGGGCAC 


ACGTGGCTTT 


TCGCTCAGGA 


60 


CTCTGCCTCT 


GCTCTCCCTC 


CTGTCCAGTT 


TGCATAAACT 


120 


TTTGATGGAC 


ACGCGGTTTC 


CAGGCACCGA 


GGCCAGAGCA 


180 


GCGGCAGTGG 


AGCCGGGTTG 


CCGGCAATGG 


GGAGAAGTGT 


240 


GCGAGGGTGC 


CTGCAGAGAC 


CCGCCTGGTG 


CACTCTGATT 


300 


WcTCGTTCC 


CTTGTTTCTC 


ACCACCTCTT 


GGGTTGCCAT 


360 


OTGTTGATCC 


TCTCGTTGCC 


TCCTGGTCAC 


TG 


412 



(2) INFORMATION FOR SEQ \D NO: 5: 

(i) SEQUENCE CHARACTBRISTICS: 

(A) LENGTH: 1012\base pairs 

(B) TYPE: Nucleom.de 

(C) STRANDEDNESS : \Single 

(D) TOPOLOGY: Linel^r 

(ii) MOLCULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL/ ISOLATE 



luman 



(Xi) SEQUENCE DESCRIPTION: SEQ NO: 5: 
GGGGTCCTGG GCCCACCCGG GCAGGACGCG TGGaVcGAGT GACCGTGGTT TCTGTGTGGT 
GTCACCTGCC AGACCCGCCG AAGAAGCCAC CTCTtWg GGTGCGCTCT CTGGCACGCG 
CCACTCCCAC CCATCCGTGG GCCGCCAGCA CCACGcW CCCCCATCCA CATCGCGGCC 
ACCACGTCCC TGGGACACGC CTTGTCCCCC GGTGTAcJsCC GAGACCAAGC ACTTCCTCTA 
CTCCTCAGGC GACAAGGAGC AGCTGCGGCC CTCCTTCck CTCAGCTCTC TGAGGCCCAG 
CCTGACTGGC GCTCGGAGGC TCGTGGAGAC CATCTTTCt\ GGTTCCAGGC CCTGGATGCC 
AGGGACTCCC CGCAGGTTGC CCCGCCTGCC CCAGCGCTAcWaAATGC GGCCCCTGTT 

tctggagctg cttgggaacc acgcgcagtg cccctacggg y GCTCCTCA AGACGCACTG 
cccgctgcga gctgcggtca ccccagcagc cggtgtctgt gWgggaga AGCCCCAGGG 
ctctgtggcg gcccccgagg aggaggacac agacccccgt cgVctggtgc AGCTGCTCCG 

CCAGCACAGC AGCCCCTGGC AGGTGTACGG CTTCGTGCGG GCcWcCTGC GCCGGCTGGT 
GCCCCCAGGC CTCTGGGGCT CCAGGCACAA CGAACGCCGC TTCcWgA ACACCAAGAA 
GTTCATCTCC CTGGGGAAGC ATGCCAAGCT CTCGCTGCAG GAGCtLcGT GGAAGATGAG 

cgtgcgggac tgcgcttggc TGCGCAGGAG CCCAGGTGAG GAGGTG^TGG ccgtcgaggg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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CCCAGGCCCC AGAGCTGART GCAGTAGGGG CTCAGAAAAG GGGGCAGGCA GAGCCCTGGT 
CCTCCTGTCT CCATCGTCMT GTGGGCACAC GTGGCTTTTC GCTCAGGACG TCGAGTGGAC 
ACGGTGATCT CTGCCTCTGCA TCTCCCTCCT GTCCAGTTTG CATAAACTTA CG 
(2) INFORMATION FOR SE© ID NO: 6: 

(i) SEQUENCE CHARAOTERISTICS : 

(A) LENGTH: 39^2 base pairs 

(B) TYPE: Nucl^tide 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: cDl 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 



900 
960 
1012 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL/ ISO] 



: Hiiman 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 6: 

GAATTCGCGG CCGCGTCGAC GTTTCAGGCA ctCCTGCGTC CTGCTGCGCA CGTGGGAAGC 60 

CCTGGCCCCG GCCACCCCCG CGATGCCGCG CGCTCCCCGC TGCCGAGCCG TGCGCTCCCT 120 

GCTGCGCAGC CACTACCGCG AGGTGCTGCC GCTKGCCACG TTCGTGCGGC GCCTGGGGCC 180 

CCAGGGCTGG CGGCTGGTGC AGCGCGGGGA CCCGSCGGCT TTCCGCGCGC TGGTGGCCCA 240 

GTGCCTGGTG TGCGTGCCCT GGGACGCACG GCCGCCCCCC GCCGCCCCCT CCTTCCGCCA 300 

GGTGTCCTGC CTGAAGGAGC TGGTGGCCCG AGTGCTSCAG AGGCTGTGCG AGCGCGGCGC 360 

GAAGAACGTG CTGGCCTTCG GCTTCGCGCT GCTGGACGGG GCCCGCGGGG GCCCCCCCGA 420 

GGCCTTCACC ACCAGCGTGC GCAGCTACCT GCCCAACAfcG GTGACCGACG CACTGCGGGG 480 

GAGCGGGGCG TGGGGGCTGC TGCTGCGCCG CGTGGGCGAC GACGTGCTGG TTCACCTGCT 540 

GGCACGCTGC GCGCTCTTTG TGCTGGTGGC TCCCAGCTGa GCCTACCAGG TGTGCGGGCC 600 

GCCGCTGTAC CAGCTCGGCG CTGCCACTCA GGCCCGGCCc\ CCGCCACACG CTAGTGGACC 660 

CCGAAGGCGT CTGGGATGCG AACGGGCCTG GAACCATAGC GTCAGGGAGG CCGGGGTCCC 720 

CCTGGGCCTG CCAGCCCCGG GTGCGAGGAG GCGCGGGGGC AGTGCCAGCC GAAGTCTGCC 7 80 

GTTGCCCAAG AGGCCCAGGC GTGGCGCTGC CCCTGAGCCG GAGCGGACGC CCGTTGGGCA 840 

GGGGTCCTGG GCCCACCCGG GCAGGACGCG TGGACCGAGT GACCGTGGTT TCTGTGTGGT 900 

GTCACCTGCC AGACCCGCCG AAGAAGCCAC CTCTTTGGAG GGTGCGCTCT CTGGCACGCG 960 

CCACTCCCAC CCATCCGTGG GCCGCCAGCA CCACGCGGGC CCCOCATCCA CATCGCGGCC 1020 
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4 



ACCACGTCCC TGGGACACffiC CTTGTCCCCC GGTGTACGCC GAGACCAAGC ACTTCCTCTA 
CTCCTCAGGC GACAAGGAGC AGCTGCGGCC CTCCTTCCTA CTCAGCTCTC TGAGGCCCAG 
CCTGACTGGC GCTCGGAGGCVTCGTGGAGAC CATCTTTCTG GGTTCCAGGC CCTGGATGCC 
AGGGACTCCC CGCAGGTTGC QCCGCCTGCC CCAGCGCTAC TGGCAAATGC GGCCCCTGTT 
TCTGGAGCTG CTTGGGAACC AQGCGCAGTG CCCCTACGGG GTGCTCCTCA AGACGCACTG 
CCCGCTGCGA GCTGCGGTCA CCQCAGCAGC CGGTGTCTGT GCCCGGGAGA AGCCCCAGGG 
CTCTGTGGCG GCCCCCGAGG AGgAgGACAC AGACCCCCGT CGCCTGGTGC AGCTGCTCCG 
CCAGCACAGC AGCCCCTGGC AGGTOTACGG CTTCGTGCGG GCCTGCCTGC GCCGGCTGGT 
GCCCCCAGGC CTCTGGGGCT CCAGGOACAA CGAACGCCGC TTCCTCAGGA ACACCAAGAA 
GTTCATCTCC CTGGGGAAGC ATGCCAASCT CTCGCTGCAG GAGCTGACGT GGAAGATGAG 
CGTGCGGGAC TGCGCTTGGC TGCGCAGG^ CCCAGGTGAG GAGGTGGTGG CCGTCGAGGG 
CCCAGGCCCC AGAGCTGAAT GCAGTAGGGgXc^CAGAAAAG GGGGCAGGCA GAGCCCTGGT 
CCTCCTGTCT CCATCGTCAC GTGGGCACAc/§zGGCTTTTC GCTCAGGACG TCGAGTGGAC 

acggtgatct ctgcctctgc tctccctcct gtccagtttg cataaactta cgaggttcac 

CTTCACGTTT TGATGGACAC GCGGTTTCCA GGCGCCGAGG CCAGAGCAGT GAACAGAGGA 
GGCTGGGCGC GGCAGTGGAG CCGGGTTGCC GGCAATGGGG AGAAGTGTCT GGAAGCACAG 
ACGCTCTGGC GAGGGTGCCT GCAGGGGTTG GCTGTGTTCC GGCCGCAGAG caccgtctgc 

gtgaggagat cctggccaag ttcctgcact ggctgatoag tgtgtacgtc gtcgagctgc 



TCAGGTCTTT CTTTTATGTC ACGGAGACCA CGTTTCi 



gaacaggctc tttttctacc 



ggaagagtgt ctggagcaag ttgcaaagca ttggaatcaffl acagcacttg aagagggtgc 
agctgcggga gctgtcggaa gcagaggtca ggcagcatcg\ggaagccagg cccgccctgc 
tgacgtccag actccgcttc atccccaagc ctgacgggct qcggccgatt gtgaacatgg 
actacgtcgt gggagccaga acgttccgca gagaaaagag gotggctgtg ctttggttta 
acttcctttt taaacagaag tgcgtttgag ccccacattt ggtatcagct tagatgaagg 
gcccggagga ggggccacgg gacacagcca gggccatggc acggsgccaa cccatttgtg 
cgcacggtga ggtggccgag gtgccggtgc ctccagaaaa gcagcatggg ggtgtagggg 
gagctcctgg ggcagggaca ggctctgagg accacaagaa gcagctgggc cagggcctgg 



ATGCAGCACG GCCCGAGCGG GTGGGGGCCC ACCACGCCAT TCTGGTC. 



GGTGTTGTAG 



TCGTAATAGC CGGCCCAGGC GCTCTGAACC TTCAGAGTCT CAAAAGCTGG GACCCTCAGG 
GCCAAATGGG GCCACACCTT GTCCTGGAAG AAATCATGGT CCACTTCCAGV GTTCGCCGGG 
TCCGGTTCTT CCTGCTCAGT GGGGCTACGA CCACCTAGGT AGTTGCTACC TAATCCTTCC 
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CGGCGAAAAT AGGCTCCACxT GGTGTCTGCA ACAAGCGGAG TCTCTAGGCC TGGTCCCTGG 2940 

GGGCAGTGCC ACACATACAQ ATACCTTTTC CTCGGCTCCA CAGGTAGCTT GGTGCCCTGC 3000 

AGGGTGCCAG GCGGCCCCTC \tCCAACACCA GCCAGTGCTG CGATTTGCGC AGACCAGGCT 3060 

CCGGCTGCGT TGATCACAAT C&GCGCATTCC ACAGGCTGGT ACTCCAGGCT GCGGTCCATC 3120 

TTCACATGGA CTTCATGGAT CaTTTTCAAG ACCACCGCTT TGTCATCTGT GGTCAACATG 3180 

CGTTGAGATG AAGAGACAAA ACqTGTCACC TCTCCCTGGC AGAAAAGGAC TCCCAAGGAC 3240 

TGGACCTTTC GCCGAAGCCC CTGQAGCAGA CACCAGGGGT CAAACCAACC TTCGTCCTCC 3300 

ATCCCATAAG ACGCCAAAGC CACTcVcTCT GTGTTTATCC AGGGAAACTT GTTCCGAAGC 33 60 

TGATCAGGAG ACATCAGAGA AACTTTGiGCT CCCTCCTGCC TCTGCACTTT CACGTTGCTC 3420 

TCCATGGCTG CAGCATCCTT TTCTGAAGCC AGCAAGAGGT AGCCCGAGGG GTTGAACCGG 3480 

AGGTCCAGGG GAGGAGCATC GACTACGGCt AGGTACTCAT TGATGTTCCG TAGAAAGCTG 3540 

GCTGAAAAGA GGGAGAGCTG GATGTTCTCaX GflCAATGAGA ACTGCTGACA AATCCCACCT 3600 

ACTGAGAGCC CAGTGGAGGC CTGTGAATACy«TGTGGTCCC GTTCCACCAC TAGCACTCGA 3660 

£ ATAGCACCTC GTCTGCTCTC CAGCTTCTTC AffiCCAATAGG CCACAGACAA GCCAAGCACC 3720 

f; CCACCTCCCA CGATCACCAC ATCCGAGTGC TCOBGGAGGCA GGTGGCTGGT GTCTTGCAGT 3780 

ft AGATCACAGG ACCTTCCAGG CAGGATCGAC TTGATCTTCT TCTTAATCTC AGACACCTTT 3840 

^ CCATCCCAGT CCAGAGAAAA GCCTCCTCTG CGCG'mSCCTG GCCTCCGGGT CAAGAGGCCC 3900 

Z CGGCCCATGC CGTGCGGCAG AACCCTCCGA ATCATi^CCC CTCTGAGCCC GGGTCGACGC 3960 

0 GGCCGCGAAT TC \ 3972 
g (2) INFORMATION FOR SEQ ID NO : 7 : \ 

(i) SEQUENCE CHARACTERISTICS: \ 

(A) LENGTH: 2 089 base pairs \ 

(B) TYPE: Nucleotide \ 

(C) STRANDEDNESS : Single \ 

(D) TOPOLOGY: Linear \ 

(ii) MOLECULE TYPE: cDNA \ 

(v) FRAGMENT TYPE: Linear \ 

(vi) ORIGINAL SOURCE: \ 
(C) INDIVIDUAL/ ISOLATE: Human \ 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: \ 

CCGGAAGAGT GTCTGGAGCA AGTTGCAAAG CATTGGAATC AGACAGCACT TGAAGAGGGT 60 

GCAGCTGCGG GAGCTGTCGG AAGCAGAGGT CAGGCAGCAT CGGGAAGCCA GGCCCGCCCT 12 0 

GCTGACGTCC AGACTCCGCT TCATCCCCAA GCCTGACGGG CTGCGGCCQA TTGTGAACAT 180 
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GGACTACGTC GTGGGAGCCA 



lCGTTCCG CAGAGAAAAG AGGGCCGAGC GTCTCACCTC 



GAGGGTGAAG GCACTGTTCA GOGTGCTCAA CTACGAGCGG GCGCGGCGCC CCGGCCTCCT 
GGGCGCCTCT GTGCTGGGCC TGCmCGATAT CCACAGGGCC TGGCGCACCT TCGTGCTGCG 

tgtgcgggcc caggacccgc cgccVgagct gtactttgtc aaggtggatg tgacgggcgc 

GTACGACACC ATCCCCCAGG ACAGGSTCAC GGAGGTCATC GCCAGCATCA TCAAACCCCA 
GAACACGTAC TGCGTGCGTC GGTATGSCGT GGTCCAGAAG GCCGCCCATG GGCACGTCCG 
CAAGGCCTTC AAGAGCCACG TCTCTAC^TT GACAGACCTC CAGCCGTACA tgcgacagtt 

cgtggctcac ctgcaggaga ccagcccgct gaggggtgcc gtcgtcatcg agcagagctc 

CTCCCTGAAT GAGGCCAGCA GTGGCCTCTY CGACGTCTTC CTACGCTTCA TGTGCCACCA 
CGCCGTGCGC ATCAGGGGCA AGTCCTACG'A CCAGTGCCAG GGGATCCCGC AGGGCTCCAT 
CCTCTCCACG CTGCTCTGCA GCCTGTGCTA CGGCGACATG GAGAACAAGC TGTTTGCGGG 



GATTCGGCGG GACGGGCTGC TCCTGCGTTT 



feGATGAT TTCTTGTTGG TGACACCTCA 



CCTCACCCAC GCGAAAACCT TCCTCAGGAC CCTGGTCCGA GGTGTCCCTG AGTATGGCTG 
CGTGGTGAAC TTGCGGAAGA CAGTGGTGAA CTTCCCTGTA GAAGACGAGG CCCTGGGTGG 
CACGGCTTTT GTTCAGATGC CGGCCCACGG CCTaVtCCCC TGGTGCGGCC TGCTGCTGGA 
TACCCGGACC CTGGAGGTGC AGAGCGACTA CTCCAGCTAT GCCCGGACCT CCATCAGAGC 

cagtctcacc ttcaaccgcg gcttcaaggc tgggaogaac atgcgtcgca aactctttgg 
ggtcttgcgg ctgaagtgtc acagcctgtt tctgga'sttg caggtgaaca gcctccagac 
ggtgtgcacc aacatctaca agatcctcct GCTGCAGGCG tacaggtttc acgcatgcgt 
gctgcagctc ccatttcatc agcaagtttg gaagaacccc acatttttcc tgcgcgtcat 

CTCTGACACG GCCTCCCTCT GCTACTCCAT CCTGAAAGct AAGAACGCAG GTATGTGCAG 
GTGCCTGGCC TCAGTGGCAG CAGTGCCTGC CTGCTGGTG'A TAGTGTGTCA GGAGACTGAG 
TGAATCTGGG CTTAGGAAGT TCTTACCCCT TTTCGCATCa\gGAAGTGGTT TAACCCAACC 
ACTGTCAGGC TCGTCTGCCC GCCCTCTCGT GGGGTGAGCA GAGCACCTGA TGGAAGGGAC 
AGGAGCTGTC TGGGAGCTGC CATCCTTCCC ACCTTGCTCT QCCTGGGGAA GCGCTGGGGG 
GCCTGGTCTC TCCTGTTTGC CCCATGGTGG GATTTGGGGG GqCTGGCCTC TCCTGTTTGC 
CCTGTGGTGG GATTGGGCTG TCTCCCGTCC ATGGCACTTA GGGCCCTTGT gcaaacccag 

gccaagggct taggaggagg ccaggcccag gctaccccac ccctctcagg agcagaggcc 
gcgtatcacc acgacagagc cccgcgccgt cctctgcttc ccagtcaccg tcgtctgccc 
ctggacactt tgtccagcat cagggaggtt tctgatccgt ctgaaattca agccatgtcg 

AACCTGCGGT CCTGAGCTTA ACAGCTTCTA CTTTCTGTTC TTTCTGTGTT GTGGAGACCC 
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TGAGAAGGAC CCTGGGAGCT CTGGGAATTT JbGAGTGACCA AAGGTGTGC 2089 



: 3 




